ACCESSION  FOR 


NTIS  GRA&I 

OTIC  TAB 

UNANNOUNCED 
JUSTIFICATION 


LEVEL 


BY 

DISTRIBUTION  / 

AVAILABILITY  CODE! 

5 

PHOTOGRAPH  THIS  SHEET 


ifO  FiLE  C0P_Y 


INVENTORY 


DOCUMENT  IDENTIFICATION 

199^ 


DISTRIBUTION  STATEMENT 


DISTRIBUTION  STAMP 


^-'LECTEi 
JUN2  7  1990i 


4,  * 


DATE  ACCESSIONED 


DATE  RETURNED 


^0  60  ?r  6il!>~ 


DATE  RECEIVED  IN  DTIC 


REGISTERED  OR  CERTIFIED  NO. 


PHOTOGRAPH  THIS  SHEET  AND  RETURN  TO  DTIC-FDAC 


DTIC  PORM  70A 
MAR  86 


DOCUMENT  PROCESSING  SHEET 


PREVIOUS  EDITION  MAY  BE  USED  UNTIL 
STOCK  IS  EXHAUSTED. 


LOAN  DOCUMENT 

TRANSMITTAL 


THE  ATTACHED  DOCUMENT  HAS  BEEN 
LENT  TO  DTIC  FOR  PROCESSING.  DO 
NOT  MARK  OR  MUTILATE  THIS  COPY.  • 
PLEASE  GIVE  IT  SPECIAL  HANDLING 
SO  THE  DOCUMENT  MAY  BE  RETURNED 
TO  THE  LENDER  PROMPTLY  .  RETURN 
THIS  REPORT  TO  THE  SELECTION 
SECTION,  DTIC-^K)AC. 


DO  NOT  PHOTOGRAPH  THIS  FORM 

LOAN  DOCUMENT 


PREVIOUS  EDITIONS  ARE  OBSOLETE 


O'fi  7^^ 


I.  Afiuecr  uu  omlt  ttMw  m«u  1 2.  urair  OAri 


oar 

Annual 


APPENDIX  E 


THESIS 


■'  •  » 

•  AFOSR-TR*  on  0 6  9 gl 

I.  INTRODUCTION 

\ 

Critical  to  the  success  of  the  Air  Force  Office  of  Scientific  Research  (AFOSR) 
mission  is  the  ability  of  AFOSR  to  draw  upon  the  research  community  in  the  United 
States  to  respond  to  its  needs.  In  recent  years,  however,  the  ntimber  of  U.  S.  citizens 
seeking  advanced  degrees  in  the  areas  of  Air  Force  research  interests  has  been  decreasing. 
This  refers  specifically  to  the  number  of  U.  S.  citizens  obtaining  Ph.D.  degrees  in  areas 
of  mathematics  and  science  that  are  of  interest  to  the  Air  Force.  This  situation  points 
toward  the  potential  problem  of  a  future  shortage  of  qualified  researchers  in  areas  critical 
to  the  nation’s  security  interest. 

To  address  this  problem,  the  United  States  Air  Force  Laboratory  Graduate 
Fellowship  Program  (USAF/LGFP)  was  established.  The  contract  is  funded  iinder  the  Air 
Force  Systems  Command  by  the  AFOSR.  The  program  annually  provides  three<year 
fellowships  for  at  least  25  Ph.D.  students  in  rasearch  areas  of  interest  to  the  Air  Force. 
Universal  Energy  Systems,  Inc.  (UES)  has  completed  the  third  year  of  the  three-year  LGF 
program  contract. 

This  report,  prepared  in  compliance  with  contractual  requirements,  covers  the  third 
year  of  the  program  which  now  sponsors  27  first-year  participants  as  well  as  25 
second-year  fellows  and  22  third  year  fellows  for  a  total  of  74  active  fellowships.  The 
report  addresses  an  overview  of  the  administration  tasks,  statistics  on  the  1989  awards, 
profiles  of  all  the  fellows,  and  summarized  results  of  the  evaluation  process.  Materials 
deemed  inappropriate  for  iuclusion  in  the  main  body  of  the  report,  such  as  samples  of 
forms,  complete  questionnaire  results,  etc.,  are  included  in  the  appendices. 

II.  ADMINISTRATION 

The  administration  of  the  LGF  program  is  conducted  from  the  Dayton  offices  of 
UES.  The  staff  consists  of  Mr.  Rodney  C.  Darrah,  Program  Manager;  Ms.  Judy  Conover, 
Program  Administrator;  and  support  personnel.  Most  members  of  the  1989  program 
administration  team  have  been  involved  with  the  project  since  award  of  the  contract  to 
UES.  This  element  of  an  experienced,  stable  staff  ensures  program  continuity  and 
contributes  to  successful  operation  of  administrative  tasks. 

The  primary  tasks  in  managing  the  program  consist  of  advertising  (which  includes 
compiling  and  updating  a  mailing  list,  and  preparing  and  distributing  ads,  flyers,  and 


1 


DISSERTATION 


ELECTROMAGNETIC  EXCITATIONS  ON  ANTIFERROMAGNETS: 
SURFACE  POLARTTONS  AND  LEAKY  WAVES 


Submitted  by 
Robert  L.  Stamps 
Physics  Department 


In  partial  fulfillment  of  the  requirements 
for  the  Degree  of  Doctor  of  Philosophy 
Colorado  State  University 
Fort  Collins,  Colorado 
Fall  1988 


COLORADO  STATE  UNIVERSITY 


FaU  1988 

WE  HEREBY  RECOMMEND  THAT  THE  DISSERTATION  PREPARED 
UNDER  OUR  SUPERVISION  BY  ROBERT  L.  STAMPS  ENTITLED 
ELECTROMAGNETIC  SURFACE  EXCITATIONS  ON  ANTIFERROMAGNETS  BE 
ACCEPTED  AS  FULFILLING  IN  PART  REQUIREMENTS  FOR  THE  DEGREE  OF 
DOCTOR  OF  PHILOSOPHY. 


Adviser 


Departnicrit  Head 


ABSIRACT  OF  DISSERTATION 

ELECTROMAGNETIC  EXCITATIONS  ON  ANTIFERROMAGNETS: 
SURFACE  POLARTTONS  AND  LEAKY  WAVES 


Surface  polaritons  exist  at  frequencies  between  250  GHz  and  a  few  THz  when 
propagating  on  antiferromagnets.  They  have  been  predicted  theoretically  and  observed 
recently  on  MnF2  in  reflection  measurements.  Although  damping  is  necessary  for  coupling 
between  the  surface  polaiiton  modes  and  incident  light  waves,  theoretical  studies  have  not 
examined  the  effect  of  damping  on  the  surface  polariton  modes.  An  analysis  of  the  surface 
polariton  modes,  with  damping,  is  done  both  by  solving  the  dispersion  equation  and  by 
studying  the  classical  electromagnetic  Green's  functions  for  a  semi-infinite  antiferromagnet. 
It  addition  to  modifying  the  surface  polariton  dispersion  curves,  damping  also  allows  for  the 
existence  of  new  surface  modes  in  regions  otherwise  forbidden  to  surface  wave  propagation. 
These  new  modes  are  analogous  to  the  Brewster  and  evanescent  modes  found  for 
electromagnetic  surface  waves  on  metals  and  dielectrics.  The  Green's  functions  are  also 
applied  to  the  problem  of  scattering  light  off  of  a  rough  surface  antiferromagnet  and  are  used 
to  show  that  surface  roughness  allows  the  incident  wave  to  excite  surface  polaritons  and 
leaky  modes.  Surface  roughness  is  found  to  enhance  nonreciprocal  reflectance  (a  reflectance 
which  is  different  for  incidence  at  angle  0  and  -0).  It  has  been  argued  that  nonreciprocal 
reflection  cannot  exist  without  nonreciprocal  absorption  processes  in  the  material. 


iii 


Depending  on  the  polarization  of  the  incident  wave,  however,  it  is  shown  that  nonreciprocal 
reflection  can  exist  without  absorption.  In  an  example  calculation,  the  reflectance  of  an 
antiferromagnet  is  found  to  be  highly  nonrcciprccal  when  the  the  illuminating  beam  is 
circularly  polarized. 


Robert  L.  Stamps 
Physics  Department 
Colorado  State  University 
FortCoUins,CO  80531 
Fall  1988 


ACKNOWLEDGMENTS 


I  gratefully  acknowledge  the  Physics  Department  of  the  University  of  Colorado  at 
Colorado  Springs,  where  much  of  this  work  was  done,  for  generously  allowing  me  the  use 
of  their  facilities.  I  also  want  to  thank  the  Physics  Department  of  Colorado  State  University 
for  allowing  me  the  freedom  to  work  closely  with  the  University  of  Colorado.  I  am 
especially  grateful  to  Professor  Carl  E.  Patton  for  his  assistance  and  support,  and  to  the 
members  of  his  magnetics  group  whose  assistance  and  knowledge  made  my  time  at  Colorado 
State  enjoyable  and  rewarding.  Most  especially,  I  want  to  thank  Professor  R.  E.  Camley, 
University  of  Colorado,  for  the  opportunity  to  participate  in  these  projects.  I  thank  him  for 
his  patience,  his  guidance,  and  his  unwaivering  encouragement  and  enthusiasm. 


TABLE  OF  CONTENTS 


£as£ 


ABSTRACT 

ACKNOWLEDGMENTS 
TABLE  OF  CONTENTS 

CHAPTER 

1  INTRODUCTION 

2  MAGNETIC  EXCITATIONS  AND  POLARTTONS 

2.1  Magnetic  Interactions  ^ 

2.2  Antifenromagnetic  Susceptibili^ 

2.3  Magnetostatic  Waves 

2.4  Bulk  Antiferromagnetic  Polaritons 

2.5  Surface  Andfeiromagnetic  Polaritons 

2.6  Nonreciprocity 

2.7  Damping  Effects 

2.8  Surface  Roughness 

3  NONRECIPROCAL  REFLECTION 

3.1  Possibility  of  nonreciprocal  reflectance 

3.2  Theory 

3.3  Results 


iii 

V 

vi 


1 

6 

6 

12 

13 

15 

21 

23 

26 

30 

34 

34 

41 

47 


4 


POLARTTONS  WITH  DAMPING 


61 


4.1  Surface  Polaritons  With  BIoch-Bloembeigen  Damping  61 

4.2  Surface  Polaritons  With  Landau-Iifshitz  Damping  78 

5  EIXCTROMAGNETIC  GREEN’S  FUNCTIONS  83 

5.1  Green's  Functions  84 

5.2  Surface  Polaritons  96 

6  SCATTERING  FROM  ROUGH  SURFACES  101 

6.1  Bom  Approximation  For  Scattered  Fields  103 

6.2  One  Dimensional  Grating  117 

6.3  Randomly  Rough  Surface  131 

7  CONCLUSIONS  138 

7.1  Bulk  Polaritons  and  Nonieciprocal  Reflection  139 

7.2  Lea.ky  Surface  Modes  On  Antifenomagnets  1 39 

7.3  Scattering  From  Periodic  Gratings  140 

7.4  Scattering  From  Rough  Surfaces  141 

7.5  Future  Extensions  141 

REFERENCES  143 

APPENDICES  145 

A  Fresnel  Relations  145 

B  Green's  Functions  147 

C  Landau-Lifshitz  Damping  Terms  150 


I 

I 

i 

■  USTOFTABLES 


Table 

Pass 

I 

Physical  Parameters  For  Andfeiromagnels 

11 

n 

Qiaracteristics  of  Leaky  Modes 

29 

viii 


USTOFHGURES 


Figure  Page 

2.1  The  Andferromagnedc  Lattice  7 

2.2  Bulk  And  Surface  Spin  Waves  9 

2.3  Dispersion  Curves  for  Hq=0  18 

2.4  Dispersion  Curves  for  HQ=.3kG  19 

2.5  Precession  Modes  20 

2.6  Symmetry  Argument  25 

2. 7  Reduced  Btillouin  Zone  For  Grating  Induced  Scattering  33 

3.1  Reflection  Geometry  36 

3.2  Detailed  Balance  Argument  37 

3 . 3  Reflectance  and  Transmittance  for  Linear  Polarizations  49 

3.4  Change  in  Reflectance  for  Left  Circular  Polarizations  50 

3.5  Change  in  Reflectance  for  Right  Circular  Polarizations  54 

3.6  Change  in  Reflectance  for  Different  (j>  56 

3.7  Change  in  Reflectance  in  Different  Bulk  Bands  57 

3.8  Reflectance  and  Transmittance  for  ca/G=  1.006  59 


IX 


4.1 

Dispersion  Curves  with  Damping  for  Hq=0 

65 

4.2 

Brewster  Angle  and  Leaky  Mode  Dispersion  Curve 

67 

4.3 

Decay  Parameters  for  Leak/  Modes,  l/f2x=.0002 

68 

4.4 

Decay  Parameters  for  Leaky  Modes,  1/Dt3:.0008 

70 

4.5 

Dispersion  Curves  with  Damping  for  Ho».3kG 

71 

4.6 

Decay  Parameters  for  Leaky  Modes  with  HQS.3kG 

73 

4.7 

Power  Flows  in  the  Material  and  Vacuum  for 

75 

4.8 

Power  Flows  in  the  Material  and  Vacuum  for  HQ3.3kG 

77 

4.9 

Dispersion  Curves  with  Landau  Damping 

82 

5.1 

Poles  of  gjyj 

97 

5.2 

Dispersion  Curve  and  g^ 

100 

6.1 

Rough  Surface  Geometry 

104 

6.2 

Dispersion  Curves  with  Grating  Lines, 

121 

6.3 

Power  Flows  on  Grating  with  1/Dt=:.0002 

123 

6.4 

Power  Flows  on  Grating  with  l/flx=.0008 

125 

6.5 

Dispersion  Curves  with  Grating  Lines,  H()=.3kG 

127 

6.6 

Evanescent  Power  Flows  for  the  Grating 

128 

6.7 

Reflectance  of  the  Grating 

130 

6.8 

Evanescent  Power  Flows  for  a  Random  Surface,  o=.l 

134 

6.9 

Evanescent  Power  Flows  for  a  Random  Surface,  o=.5 

135 

6.10 

Reflectance  of  the  Rough  Surface 

137 

X 


CHAPTERl 


INTOODUCnON 

The  coupling  between  electromagnetic  waves  and  the  fundamental  excitations  in  a 
material  (plasmons,  phonons,  etc.)  produces  what  is  Imown  as  a  polariton.  Polaritons  in 
dielectrics  and  metals  have  been  extensively  studied  both  theoretically  and  experimentally. 
Surface  polaritons  in  dielectrics  and  metals,  where  the  amplitude  of  the  excitation  is  contined 
to  the  region  near  the  surface,  have  also  received  a  great  deal  of  attention^  and  are  now  used 
as  a  tool  in  studying  the  vibrational  spectra  of  very  thin  fUms.^ 

In  contrast,  magnetic  polaritons^  (coupled  electromagnetic  waves  and  spin  waves) 
and  magnetic  surface  polaritons^  have  received  less  attention.  One  reason  for  this  is  that  in 
ferromagnets  the  frequency  of  the  magnetic  polariton  is  typically  lower  (1-20  GHz  for 
metallic  ferromagnets)  than  the  frequency  for  a  phonon-polariton.  This  means  wavelengths 
for  fenomagnetic  polaritons  are  m  the  millimeter  and  centimeter  range  and  are  much  larger 
than  phonon-polaritons  wavelengths.  It  is  thus  difficult  in  ferromagnets  to  realistically  obtain 
an  effectively  infinite  or  semi-infinite  sample  size  or  to  obtain  information  about  the  region 
close  to  the  surface. 

Recently,  the  properties  of  bulk  and  surface  polaritons  on  a  semi-infinite  uniaxial 
antiferromagnet  were  discussed  theoretically  and  measured  by  Remer,  et  al,  on  MnF2  using 
reflectivity  measurements.^  These  excitations  possess  several  intriguing  features.  First,  and 
in  contrast  to  the  ferromagnet,  antiferromagnetic  polaritons  have  frequencies  in  the  infrared 
with  typical  values  ranging  from  250  GHz  to  a  few  THz.  The  wavelengths  (and  penetration 
depths)  thus  range  from  millimeters  to  a  few  hundred  micrometers.  Second,  and  in  contrast 
to  phonon-polartions,  antii'erromagnetic  surface  polaritons  are  nonreciprocal,^  i.e. 


2 


0)(+k);tco(-k)  where  CO  is  the  frequency  of  the  polariton  and  k  is  the  wavevector.  For 
nonreciprocal  surface  waves,  reversing  the  direction  of  propagation  or  reversing  the  direction 
of  an  applied  magnetic  field  generally  leads  to  a  surface  polariton  of  a  different  frequency. 
The  degree  of  nonreciprocity  can  be  controlled  by  varying  the  strength  of  the  applied  field 
and  disappears  when  the  applied  field  is  turned  off. 

The  nonreciprocity  of  feiiomagnetic  and  antiferromagnetic  surface  polaritons  is  a 
primary  reason  for  studying  these  excitations.  From  a  theoretical  point  of  view,  the  existence 
or  nonexistence  of  reciprocity  is  an  interesting  problem  in  its  own  right.  The  possibility  for 
nonreciprocal  features  of  an  excitation  in  a  complex  system  can  often  be  arrived  at  by 
considering  certain  symmetries  of  the  system,  and  so  a  knowledge  of  the  conditions  under 
which  nonreciprocity  exists  helps  to  determine  the  key  interactions  which  govern  the 
excitations.  As  an  example,  an  examination  of  the  Hamiltonian  for  a  ferromagnet  gives  the 
interesting  result  that  it  is  not  exchange  interactions  but  rather  the  dipole-dipole  interactions 
and  the  presence  of  a  boundary  on  the  material  that  determine  nonreciprocity  in  ferromagnetic 
magnons.^ 

Knowledge  of  nonreciprocal  features  are  clearly  important  for  experimental 
investigations.  The  measurements  by  Remer  of  surface  antiferromagnetic  polaritons,  for 
example,  relied  on  the  nonreciprocity  of  the  surface  polaritons  in  an  applied  magnetic  field  to 
provide  a  reflectance  which  was  nonreciprocal  with  respect  to  the  direction  of  the  applied 
field.  Other  examples  where  nonreciprocity  can  easily  be  exploited  are  magnetic  superlattices 
and  antiferromagnetic  thin  films.  These  systems  are  expected  to  support  nonreciprocal  spin 
waves  even  without  applied  magnetic  fields.  In  the  superlattice,  the  nonreciprocity  depends 
simply  on  whether  there  is  an  even  or  odd  number  of  layers.  In  a  similar  manner,  the 
nonreciprocity  in  a  thin  film  with  (100)  surfaces  depends  on  whether  there  is  an  even  or  odd 
number  of  (1(X))  sublattice  planes  across  the  film. 

Nonreciprocal  properties  also  have  a  number  of  important  technological 
applications.^’^  A  list  of  existing  devices  used  in  microwave  signal  processing  which 


3 


depend  on  nonreciprocal  features  include  circulators,  isolators,  phase  shifters,  delay  lines, 
resonators  and  signal  to  noise  enhancers.  At  microwave  frequencies,  these  devices  depend 
on  the  nonreciprocity  of  ferromagnetic  magnetostatic  modes  and  are  commonly  manufactured 
out  of  YIG  (yttrium  iron  garnet).  A  circulator  allows  the  multiple  connection  of  different 
devices  (such  as  connecting  a  receiver  and  transmitter  to  the  same  antenna)  by  using  the 
property  that  ferromagnetic  magnetostatic  waves  on  thick  films  only  propagate  in  one 
direction .  An  isolator  uses  a  nonreciprocal  localization  of  waves  in  a  film  to  one  surface  or 
the  other  in  order  to  separate  signals.  Phase  shifters  and  delay  lines  use  applied  fields  to 
control  the  group  velocity  of  a  magnetostatic  wave  propagating  on  a  ferrite  surface. 

Any  nonreciprocal  feature  is  potentially  interesting  fiom  an  applications  point  of 
view.  By  analogy  to  the  great  many  uses  of  nonreciprocal  ferromagnetic  devices  in 
microwave  signal  processing  devices,  one  expects  there  to  be  a  corresponding  number  of 
applications  for  antifertomagnetic  devices  at  infrared  frequencies.  In  fact,  recent  proposals 
for  nonreciprocal  magnetoplasmon  based  devices,  to  operate  in  the  300  to  600  GHz  range, 
indicate  an  interest  for  devices  at  high  frequencies.^® 

As  discussed  above,  surface  polaritons  on  antiferromagnets  have  already  been 
observed  and  their  I'asic  features  catalogued  and  understood.  In  terms  of  discussing  the 
nonreciprocal  properties  of  antiferromagnetic  polaritons,  however,  only  the  nonreciprocal 
frequency  behavior  (0(k)^-k)  of  the  surface  modes  has  been  exanoined.  The  bulk  polariton 
modes,  on  the  other  hand,  are  reciprocal  in  frequency  in  applied  magnetic  fields.  Their 
polarization,  however,  depends  on  the  magnitude  and  direction  of  the  external  field.  This 
dissertation  begins  with  an  examination  of  the  coupling  between  photons  and  bulk 
antiferromagnetic  polariton  modes  in  external  magnetic  fields  and  shows  that  there  can  be  a 
nonreciprocal  reflectance  of  an  incident  wave  such  that  the  R(9);*R(-0)  where  0  is  the  angle 
of  incidence. 

The  existing  studies  of  surface  polaritons  on  antiferromagnets  have  essentially 
considered  only  a  very  idealized  situation  where  the  antiferromagnetic  film  is  perfect  and 


4 


there  is  no  scattering  of  the  wave  either  due  to  imperfections  in  the  crystal  or  interactions  with 
other  modes.  The  next  step  toward  understanding  the  surface  polaritons  and  assessing  their 
potential  utility  is  to  ask  how  imperfections  affect  the  surface  polaritons. 

Two  types  of  imperfections  are  considered.  The  first  is  irregularities  intrinsic  to  the 
material  that  cause  the  surface  wave  to  lose  energy  as  it  propagates.  These  irregularities  are 
loosely  termed  "damping  mechanisms".  The  second  irregularity  to  be  considered  is  extrinsic: 
geometrical  imperfections  on  the  surface  of  the  material  that  can  scatter  the  surface  wave  as  it 
propagates.  Both  of  these  irregularities  are  important  on  two  counts:  first  because  damping 
and  surface  roughness  are  always  present  in  reality  to  some  degree  and  must  be  accounted 
for  in  either  experiment  or  application;  second,  and  perhaps  most  importantly,  damping  and 
roughness  represent  two  distinct  possible  mechanisms  for  coupling  the  polariton  fields  to 
external  photon  fields.  This  is  clearly  important  in  any  optical  measurement  or  application  of 
antiferromagnetic  materials.  With  this  in  mind,  the  consequences  of  material  damping  on  the 
antifeiTomagnetic  surface  electromagnetic  excitations  are  examined.  Next,  the  excitations  are 
studied  by  using  surface  electromagnetic  response  functions  for  the  material.  Finally,  the 
response  functions  are  used  to  estimate  the  reflectance  of  a  rough  surfaced  antifeiromagnet 

A  surprising  result  of  including  damping  into  the  antiferromagnet  and  searching  for 
possible  surface  excitations  is  the  existence  of  "leaky"  surface  modes.  These  are  excitations 
that  depend  on  damping  for  their  existence  and  have  finite  lifetimes  and  path  lengths.  They 
are  localized  to  the  surface,  but  transmit  energy  into  the  bulk  of  the  material.  An  interesting 
feature  is  that  the  leaky  modes  exist  in  frequency  and  wavelength  regions  where  true  surface 
modes  cannot.  Leaky  modes  have  been  predicted  and  observed  for  non-magnetic  materials 
and  the  leaky  surface  modes  found  in  this  magnetic  system  are  analogous  to  the  Brewster 
surface  polariton  modes  found  in  dielectric  materials. 

The  outline  of  this  dissertation  is  as  follows.  Chapter  2  is  an  overview  of 
electromagnetic  excitations  in  antiferromagnets  and  gives  a  qualitative  discussion  of  the 
effects  of  including  damping  and  roughness  into  the  description  of  the  surface  polaritons.  In 


5 


Chapter  3  the  reflectance  of  light  from  an  idealized  semi-infmite  antiferromagnet  is  calculated 
and  the  nonreciprocity  of  the  reflectance  with  respect  to  the  incident  angle  is  explained.  In 
Chapter  4  the  effects  of  damping  are  explored  by  examining  the  surface  polariton  dispersion 
curves  obtained  from  Maxwell's  equations  and  the  properties  of  the  leaky  modes  are 
examined  in  detail.  The  surface  electromagnetic  response  functions,  or  Green's  functions, 
are  derived  in  Chapter  5  for  a  semi-infinite  antiferromagnet  and  applied  to  the  problem  of 
scattering  light  from  a  rough  surface  in  Chapter  6.  Finally,  Chapter  7  summarizes  the  main 
findings  of  this  work  and  comments  on  possible  future  extensions. 


CHAPTER2 


MAGNETIC  EXCTTATIONS  AND  POLARTTONS 

Polaritons  are  excitations  in  the  eiecttomagnetic  field  created  by  a  variety  of 
interacting  systems  of  particles.  A  gas  of  weakly  interacting  electrons,  for  example,  is  a 
medium  in  which  plasmon  polaritons  can  exist  If  a  magnetic  field  is  present,  tlien 
magnetoplasmon  polaritons  are  possible.  The  electromagnetic  excitations  in  magnetic 
systems,  such  as  ferromagnets  and  antiferromagnets,  are  known  as  magnon  polaritons.  To 
understand  a  particular  polariton  excitation,  it  is  first  necessary  to  understand  the  electrical 
and  magnedc  interactions  within  the  system  that  support  the  excitation. 

2.1  Magnetic  interactions. 

It  is  useful  to  develop  a  phenomenological  model  of  the  antiferromagnetic  crystal 
lattice  and  describe  the  properties  of  this  system  in  terms  of  macroscopic  fields.  This 
approach  is  valid  as  long  as  the  excitations  have  long  wavelengths  relative  to  the  lattice  site 
spacing.  In  this  limit,  the  electric  and  magnetic  fields  arc  averaged  over  distances  of  several 
lattice  sites  and  the  actual  fluctuating  microscopic  fields  are  replaced  by  macroscopic  effective 
fields. 

Imagine  a  simple  cubic  lattice  with  magnetic  moments  placed  at  all  the  lattice  sites. 
Suppose  further  that  the  moments  are  reversed  from  site  to  site,  forming  lattice  planes  with 
alternating  magnetizations.  There  are  then  two  sublattices  forming  the  system,  and  the 
magnetization  at  any  point  in  the  crystal  is  the  sum  of  an  average  magnetization  due  to 
one  sublattice  and  an  average  magnetization  M|,  due  to  the  other  sublattice.  Such  a  system  is 
pictured  in  figure  2.1  and  can  be  used  as  a  model  for  the  uniaxial  antiferromagnet  MnFo. 


7 


Figure  2.1.  The  antiferromagnetic  crystal  is  represented  by  a  periodic  arrangement  of 

microscopic  magnetic  moments  and  M|).  All  the  A  sublattice  moments  point 
in  the  same  direction  and  all  the  B  sublattice  moments  point  in  the  opposite 
dinscdon.  ITie  cubic  structure  shown  represents  MnF2. 


8 


The  magnitude  of  the  two  magnetizations  are  equal  in  an  antiferromagnet  and  they 
point  in  opposite  directions  so  that  the  total  magnetization  of  the  crystal  is  zero.  Here  only 
uniaxial  antiferromagnets  are  considered  where  at  equilibrium  M ^  and  Mj,  are  aligned  along 
one  particular  direction  in  the  crystal  when  there  are  no  external  magnetic  fields.  This 
direction  is  called  the  easy  axis.  A  magnetic  excitation  is  represented  by  small  deviations  of 
the  moments  at  each  site  on  the  sublatdces  away  from  the  easy  axis.  The  deviations  are 
coupled  through  the  interacting  moments  and  can  be  represented  as  waves  which  propagate 
through  the  crystal.  This  is  illustrated  in  figure  2.2  where  the  deviations  are  due  to  the 
precession  of  magnetic  moments  about  a  magnetic  field  The  force  law  governing  the 
motion  of  the  magnetizations  is  known  as  Bloch's  equation  of  motion: 

M  =  YMxI?gff  (2.1) 

The  gytomagnetic  ratio  is  yand  has  the  value  -1.805  x  lO^^rads/kG. 

The  interactions  between  the  individual  magnetic  moments  are,  on  a  macroscopic 
level,  treated  as  effective  fields  acting  on  each  sublattice.  One  is  thus  able  to  define  two 
effective  fields  and  and  write  two  coupled  equations  of  motion  of  the  Bloch 
form.  In  the  limit  of  small  deviations  ftom  the  equilibrium  directions,  these  equations  can  be 
cast  in  the  form  M  =  x  h,  where  h  is  the  macroscopic  dipolar  field,  and  used  to  find  an 
explicit  form  for  the  magnetic  susceptibility^  An  explicit  example  of  this  calculation  is 
given  in  chapter  4  where  the  susceptibilities  are  derived  when  Landau  damping  is  included 
into  the  Bloch  equations  of  motion. 

The  effective  fields  have  several  parts:  first,  there  is  a  static  externally  applied  field 
Hq,  which  will  always  be  aligned  along  the  easy  axis.  Anisotropy  fields,  which  act  to  direct 
the  moments  along  their  respective  sublattices,  arc  defined  by  the  constant  H^.  The 
macroscopic  dipolar  fields,  h,  are  defined  by  B=h+47tM.  Finally,  the  average  effect  of  the 
exchange  interaction,  which  couples  each  moment  to  its  nearest  neighbors,  is  represented 


9 


a) 


bt  Q) 


©  © 


Bulk  mode 


Surface  mode 


Figure  2.2.  (a)  The  precession  of  the  magnetizations  about  the  effective  fields  is  shown  by 
the  circles  which  repre^t  the  path  traced  out  by  the  tip  of  the  magnetization 
vector,  (b)  An  end  view  of  the  precessions  where  the  effective  field  is  normal 
to  the  plane  of  view,  (c)  The  precession  angles  of  the  magnetizations  decrease 
with  distance  into  the  material  for  waves  localized  to  the  surface. 


10 


through  a  macroscopic  field  in  the  mean  field  limit.  This  limit  assumes  that  the  field 

varies  slowly  from  lattice  site  to  lattice  site  where  the  mean  field  is  directly  proportional  to  the 
average  sublattice  magnetizations.  This  restricts  the  description  to  frequencies  where  the 
wavelength  of  the  excitation  spans  several  lattice  sites.  Demagnetization  fields  are  not  taken 
into  account  for  two  reasons.  First,  the  geometry  used  throughout  this  dissertation  is 
semi-infinite  with  the  surface  plane  parallel  to  the  easy  axis.  The  net  magnetization  normal  to 
the  easy  axis  is  zero  and  so  there  are  no  demagnetization  fields  in  directions  normal  to  the 
surface.  The  net  magnetization  is  zero  parallel  to  the  surface  and  so  there  are  also  no 
demagnetization  fields  in  these  directions  either. 

With  these  definitions,  the  effective  field  acting  on  sublattice  A  is  given  by 

=  2(Ho+Ha)  +  XMb  +  T  (2.2a) 

The  quantity  is  the  mean  exchange  field  acting  on  sublattice  A  due  to  moments  on 
sublattice  B.  X  is  an  exchange  constant  that  can  be  obtained  firom  a  microscopic  exchange 
Hamiltonian.  Note  that  the  easy  axis  coincides  with  the  z  axis  in  this  definition. 

The  effective  field  acting  on  sublattice  B  is  given  by  the  similar  expression 

=  z(Ho-Ha)  +  XMa  +  r  (2.2b) 


Here,  Xh^  is  the  mean  exchange  field  acting  on  sublattice  B  due  to  moments  on  sublattice  A. 
In  both  (2.1)  and  (2.2),  the  anisotropy  fields  are  along  the  easy  axis.  These  fields  direct  the 
moments  on  the  A  sublattice  in  the  +z  direction  and  the  moments  on  the  B  sublattice  in  the  -z 
direction.  Values  of  these  parameters  are  listed  in  Table  I  for  the  uniaxial  antiferromagnets 
MnF2,^  ^  GdA103,^^  and  FeF2.^^  These  materials  are  well  characterized  and  have  been 
studied  with  microwave,  Brillouin  scattering,  and  far  infiared  techniques. 


11 


Table  I.  Physical  parameters  of  example  uniaxial  antiferromagnets. 


MnF2 

GdA103 

FeF2 

Neel  Temperature  (K) 

67 

3.87 

79 

Exchange  field  Hgjj(kG) 

550 

18.8 

540 

Anisotropy  field  H^(kG) 

7.87 

3.65 

200 

Sublattice  magnetization  M(kG) 

.6 

.624 

.56 

Resonance  Held  Q(kG) 


93.37 


12.26 


506 


12 


2.2  Antiferromagnetic  Susceptibility. 

The  magnetic  susceptibility  x  is  derived  from  Bloch's  equations  using  (2.1)  and 
(2.2)  and  assuming  the  time  dependance  of  the  magnetizations  and  the  dipolar  fields  varies  as 
Small  deviations  in  the  x  and  y  directions  are  assumed  and  only  quantities  of  first 
order  in  M^,  My  and  h^,  hy  are  retained. 

A  useful  quantity  for  Maxwell's  electromagnetic  equations  is  the  permeability, 
defined  by  p.  =  1  +  47t  x  •  For  the  uniaxial  antiferromagnet,  this  has  the  form 


Gaussian  units  are  assumed  throughout  this  dissertation.  The  components  of  the  tensor  are 
given  by : 

47ry^MH 

\i=  1 - ^J-iX+Y)  (2.4) 

4jc7^MH 

=  -^Y-X)  (2.5) 


Here  M  is  the  magnitude  of  the  sublattice  magnetizations  and  Q  is  the  antiferromagnetic 
resonance  field  given  by 


q2  H.2He  ) 


(2.6) 


13 


The  quantities  X  and  Y  are  defined  as 

X=[((0/Q +HoY/Q  +i/nx  )2-iri  (2.7) 

and 

Y  =  [(cd/Q +i/ax  (2.8) 

X  is  a  phenomenological  spin  relaxation  time  and  is  included  as  the  simplest  possible 
representation  of  a  loss  mechanism.  It  originates  from  the  inclusion  of  the 
Bloch-Bloembergen  damping  term  into  the  right  hand  side  of  the  equations  of  motion 

(2.1).14 

The  the  magnetic  behavior  of  the  antiferromagnet  is  described  with  the  frequency 
dependant  The  material  is  dissipative  when  x  is  finite  and  has  a  resonance  at  frequency 
£1  At  resonance,  the  moments  on  each  sublattice  precess  about  the  easy  axis  uniformly  in 
phase  with  one  another. 

2.3  Magnetostatic  Waves. 

When  the  phase  of  the  precessing  moments  on  each  sublattice  varies  in  a  periodic 
manner,  as  in  figure  2,  the  excitations  are  called  spin  waves.  When  the  wavelengths  of  these 
excitations  are  very  long,  they  exist  at  frequencies  that  satisfy  the  condition  k»a)/c .  Then 
V  xH  =  0  and  the  waves  arc  independent  of  any  electric  fields.  These  excitations  are  called 
magnetostatic  spin  waves.  In  planar  ferromagnetic  structures  they  are  often  called  Damon 
Eshbach  modes  and  in  spherical  ferromagnetic  .structures,  they  are  known  as  Walker  modes. 

There  are  many  intriguing  and  useful  properties  of  these  waves.  In  both 
ferromagnets  and  antiferromagnets,  the  magnetostatic  spin  waves  exist  only  in  certain 
frequency  regions,  or  bands.  In  very  thick  films  the  bulk  modes  can  propagate  at  the 
frequencies^^ 


14 


»,=Y[Ha(2H5+Hj)ll'2 


and 


<02=ylHj(2Hj+Il,+8itM)]l« 

Note  that  od^  is  also  the  antifenomagnetic  resonance  frequency. 

In  thin  film  structures,  the  the  bulk  modes  over  a  range  of  frequencies,  or  bands, 
but  only  certain  wavevectors  are  allowed  at  any  frequency  within  a  bulk  band.  Also,  applied 
fields  drastically  affect  the  allowed  frequencies  and  wavevectors  of  the  modes.  In 
antiferromagnets,  new  frequency  bands  appear  with  the  application  of  an  applied  field. 

A  particularly  intriguing  feature  is  the  existence  of  surface  magnetostatic  waves;  i.e., 
waves  which  are  localized  to  the  surface  of  a  structure.  These  waves  propagate  along  the 
surface,  decaying  exponentially  away  in  directions  normal  to  the  surface.  In  the 
magnetostatic  region,  the  surface  modes  exist  in  frequency  regions  forbidden  to  bulk  waves. 
In  very  thick  Hlms,  surface  modes  lie  between  the  bulk  bands: 

(i)s=T(Ha(2He+Hj+4nM)]>'2 

Surface  modes  are  also  strongly  affected  by  applied  magnetic  fields.  The  presence 
of  external  fields  leads  to  antiferromagnetic  magnetostatic  surface  spin  waves  that  have 
different  frequencies  for  opposite  propagation  directions.  This  is  an  example  of 
nonreciprocity  in  frequency  with  respect  to  propagation  direction,  a  property  discussed  more 
fully  in  connection  with  antiferromagnetic  polaritons  in  chapter  3. 


15 


2.4  Bulk  Antiferromagnetic  Polaritons. 

Electromagnetic  excitations  also  occur  at  wavelengths  were  k  >  cu/c  and  the 
fluctuating  magnetic  fields  have  associated  electric  fields.  These  electromagnetic  excitations 
are  polaritons  and  exist  at  frequencies  near  those  of  magnetostatic  waves  but  have  much 
longer  wavelengths.  Just  as  photons  are  the  excitations  of  the  electromagnetic  field  in 
vacuum,  these  polaritons  are  excitations  of  the  electromagnetic  field  in  the  material.  The 
behavior  of  these  excitations  is  governed  by  the  magnetic  susceptibilities  (2.8)  and  also  by 
the  dielectric  susceptibilities.  The  constitutive  relations  B  =  p.  H  and  D  =  e  E  are  used  which 
obey  the  retarded  Maxwell  equations: 

V-'d  =0 

V-  '8=0 

vx'r+-L2.t=o 

cat 

Vxlt--f^=0 

cat 

Materials  which  are  anisotropic  in  their  dielectric  properties  will  be  considered  here. 
The  dielectric  susceptibilities  are  written  in  tensoral  form  as 


e 

i 


0 


0 


r= 


0 


0  0  ^2 


(2.9) 


The  dielectric  susceptibilities  are  assumed  to  be  constant  over  the  frequency  ranges  of  interest 
here,  although  it  is  a  simple  matter  to  substitute  the  appropriate  frequency  dependent 
functions  fore 2  and £2- 


16 


It  is  a  straight  forward  exercise  to  write  the  electromagnetic  wave  equation  from  the 
Maxwell  relations.  In  vector  form,  it  is  given  by 


Vxe^^  VxTf-(o^lf=0 


(2.10) 

Here  and  throughout  the  rest  of  this  dissertation,  0)^=  oVc.  Note  that  the  magnetic  fields  iT 
are  assumed  to  have  the  time  dependance  e*^^^  In  the  less  obvious,  but  computationally 


more  efficient  form,  this  is  equivalent  to 


m,j 


-1  a"  . 


ye-» 

1  ™  ax,^ 


—  m2 


(2.11) 

The  prime  on  the  sum  means  that  Also,  since  the  dielectric  tensor  is  diagonal. 

For  the  present,  assume  the  material  extends  infinitely  in  all  directions.  The 
solutions  to  this  equation  then  represent  bulk  antiferromagnetic  polaritons  in  an  infmite  or 
semi-infinite  media.  Plane  wave  solutions  arc  assumed  with  the  form  where  k  is 

the  wavevector  of  the  excitation  and  co  is  the  frequency. 

Using  plane  wave  solutions  for  H,  an  expression  for  k(co)  can  be  obtained  from  the 
wave  equation  (2.1 1).  For  arbitrary  directions  of  propagation,  l^co)  is  rather  complicated. 
For  the  special  case  of  propagation  in  a  direction  perpendicular  to  the  easy  axis,  the 
dispersion  relation  is  particularly  simple  and  has  the  form^^ 


(2.12) 

Here  kj^  and  ky  are  the  x  and  y  components  of  the  wavevector  k,  respectively. 


17 


First  consider  the  case  where  there  is  no  applied  field.  Then  P2  is  identically  zero 
and  the  frequency  dependance  of  the  wavevector  is  determined  by  lij.  In  frequency  regions 
where  |Xi  is  positive,  k  is  real  and  die  polaritons  are  travelling  waves.  In  regions  where 
is  negative,  k  is  imaginaiy  and  the  polaritons  decay  exponentially. 

Since  has  a  pole  at  the  antifenomagnetic  resonance  frequency,  it  becomes 
infinitely  large  as  O)  approaches  the  resonance  frequency  and  also  changes  sign.  Likewise,  k 
becomes  infinite  at  this  frequency.  For  >  0,  k  is  real  but  for  p^  <  0,  k  is  imaginaiy.  The 
frequency  regions  where  k  is  real  are  the  bulk  polariton  bands.  It  is  only  in  these  regions  that 
the  antifeiTomagnet  is  transparent  to  electromagnetic  radiation. 

In  figure  2.3,  these  bulk  bands  are  depicted  in  the  dispersion  curve  O)  vs.  k^.  The 
quantities  shown  are  in  reduced  units,  (O/Q  and  ck^/Q,  a  convention  used  throughout  this 
dissertation.  The  material  is  MnF2,  there  is  no  applied  field  and  co/f2=l  is  the  resonance 
frequency.  The  bulk  bands  are  represented  by  the  shaded  areas  and  represent  regions  where 
ky  (as  a  function  of  CD  and  k^)  is  real.  There  are  two  bulk  bands,  one  bounded  above  by  the 
resonance  frequency  and  extending  indefinitely  to  lower  frequencies,  and  one  bounded 
below  and  extending  indefinitely  to  higher  frequencies. 

In  an  applied  field,  P2  is  no  longer  zero  and  a  third  bulk  band  appears.  This  is 
depicted  in  figure  2.4  where  an  applied  field  of  .3kG  is  applied  in  the  z  direction.  The  top  of 
the  middle  bulk  band  corresponds  to  the  pole  of  l/pj  which  occurs  at  CO2,  the  high  frequency 
magnetostatic  bulk  mode  described  in  section  2.3.  In  zero  applied  field,  this  frequency 
represents  the  bottom  limit  of  the  high  frequency  bulk  polariton  band  shown  in  figure  2.3. 

In  both  figures  2.3  and  2.4,  ky  is  imaginary  outside  the  bulk  bands. 

Some  insight  into  the  nature  of  the  bulk  modes  can  be  had  by  examining  the 
resonance  motion  of  the  magnetizations  and  Mj,.  In  zero  applied  field,  these  two  vectors 
can  precess  about  the  easy  axis  in  two  different  ways.  In  one  case,  the  motion  is 


(0/Cl 


18 


Figure  2.3.  Dispersion  curves  for  antiferromagnetic  polaiitons,  in  MnF2,  with  no  applied 
field.  The  shaded  areas  are  bulk  bands  and  the  solid  lines  rising  out  of  the  bulk 
bands  are  the  surface  modes  when  there  is  no  damping  present 


(0/Q 


19 


Figure  2.4.  Dispersion  curves  for  antifenomagnedc  polaritons,  in  MnF2,  with  an  applied 
field  of  .3kG.  The  shaded  areas  are  bulk  bands  and  the  solid  lines  rising  out  of 
the  bulk  bands  are  the  surface  modes  when  there  is  no  damping  present  in  the 
material. 


20 


Figure  2  Here  the  precession  of  the  two  sublattice  magnetizations  are  compared  in  zero 

applied  field.  Each  magnetization  piecesses  about  the  effective  field  acting  on  its 
sublattice.  In  zero  applied  field,  there  are  two  degenerate  possible  rotations: 
one  in  which  the  A  sublattice  magnetization  precises  with  a  larger  angle  than 
the  B  sublattice  magnetization  and  one  in  which  the  B  sublattice  magnetization 
precesses  with  a  la^er  angle  than  the  A  sublattice  magnedzadon.  The  two 
motions  differ  in  their  sense  of  rotation.  In  an  applied  field  the  two  rotations 
still  have  opposite  directions  but  are  no  longer  degenerate. 


21 


clockwise  and  the  precession  angle  for  is  larger  than  that  of  M^.  In  the  other  case,  the 
motion  is  counter  clockwise  and  the  precession  angle  for  is  smaller  than  that  of  ^{j.These 

two  motions  are  depicted  in  figure  2.5. 

In  zero  applied  field,  the  energies  of  these  two  precessions  are  degenerate.  With  an 
applied  field,  however,  the  two  motions  are  no  longer  equivalent  and  have  different 
frequencies.  Since  the  bulk  polaritons  are  coupled  to  the  precession  of  the  magnetizations, 
they  should  somehow  also  be  affected  by  the  applied  field.  It  turns  out  that  the  polarization 
of  the  bulk  polaritons  is  controlled  by  the  applied  Held  and  this  will  be  discussed  in  detail  in 
the  next  chapter. 

2.5  Surface  Antiferromagnetic  Polaritons. 

When  the  structure  is  finite  so  that  an  interface  exists  between  the  antifeiromagnet 
and  some  dissimilar  media,  a  new  form  of  polariton  can  exist  that  is  localized  to  the  interface. 
These  are  called  surface  polaritons  and  differ  from  the  bulk  polaritons  in  many  significant 
respects.  Instead  of  a  continuum  of  frequencies  at  a  given  wavelength,  as  in  bulk  modes, 
surface  polaritons  pically  exist  at  only  a  finite  number  of  frequencies.  They  may  also 
display  a  high  degree  of  nonreciprocity  with  respect  to  frequency;  i.e.,  Q)(k)?to)(-k). 

Surface  polaritons  are  characterized  by  an  exponential  decrease  in  amplitude  away 
from  the  interface.  In  this  dissertation,  thi*:  interface  will  always  be  between  the 
antifeiromagnet  and  vacuum.  If  the  normal  to  the  surface  is  in  the  y  direction,  as  shown  in 
figure  2.6,  exponential  decay  corresponds  to  an  imaginary  ky  both  in  the  antiferromagnet  and 
outside  it  Inside  the  material,  ky  is  imaginary  when  ii^is  negative.  This  occurs  for 
frequencies  just  above  the  resonance  pole.  Outside  the  material  in  vacuum,  ky  is  imaginary 
only  for  frequencies  and  wavevectors  away  from  the  light  line  co/k  =  c  such  that  co/k  <  c. 
This  means  surface  polaritons  exist  outside  frequency  and  wavelength  regions  accessible  to 
bulk  polaritons  on  either  side  of  the  interface. 


22 


A  dispersion  relation  for  the  surface  polaritons  can  be  obtained  by  imposing 
boundary  conditions  at  the  surface.  For  the  electromagnedc  waves  considered  here,  these 
boundary  conditions  are  the  usual  continui^r  conditions  on  tangential  H  and  E  fields  and 
normal  D  and  B  fields.  The  procedure  is  to  ^ly  the  boundary  conditions  to  the  bulk 
solutions  of  the  wave  equation  (2.11)  and  tl»  bulk  solutions  of  the  wave  equation 
appropriate  to  free  space  across  the  surface.  The  solutions  in  both  regions  are  required  to 
decay  exponentially  away  from  the  surface  and  the  boundary  conditions  result  in  a 
characteristic  equation  that  can  be  solved  for  the  relationship  between  o)  and  k. 

For  surface  waves  travelling  in  the  x  direction  according  to  wavevector  kj^,  the 
solutions  inside  and  outside  the  material  (y>0  and  y<0,  respectively)  thus  have  the  form 


y>0 

(2.13) 

Bei^'x+yy 

y<0 

(2.14) 

where  a  is  the  decay  parameter  inside  the  material  (always  assumed  positive)  and  is  given  by 
(2.12)  with  ky=  i  a.  Outside  the  material,  yis  the  decay  parameter  (also  always  assumed 
positive)  and  is  given  by  [kj^^-tOg^]  Continuity  of  tangential  H  and  normal  B  lead  to  the 
implicit  dispersion  relation^^ 

Y+(^ila+^i2kx)/(ltl^-lI2^)=0  (2.15) 

The  solutions  to  this  equation  are  shown  in  figure  2.3  as  the  solid  lines  rising  out  of 
the  lower  bulk  band  and  asymptotically  approach  the  magnetostatic  frequency  cOj  as  oi/k 
tends  to  zero.  The  existence  of  two  surface  modes,  one  for  +kjj  and  one  for  -kj^ 


23 


(corresponding  to  propagation  in  the  +  and  -  x  directions),  is  characteristic  of 
antiferromagnetic  surface  excitations.  Ferromagnetic  magnetostatic  surface  modes  for 
example,  have  only  one  branch.  Note  that  both  branches  of  the  antiferromagnetic  surface 
polariton  dispersion  curve  begin  at  co^ck,  the  light  line. 

In  an  applied  field,  the  character  of  the  two  modes  changes  radically.  In  figure  2.4 
the  solid  lines  again  represent  surface  polariton  modes,  but  this  time  the  +k^  and  -k^ 
branches  have  quite  different  shapes  and  exhibit  the  nonreciprocity  of  the  surface  modes  in 
applied  fields.  Both  of  these  branches  begin  at  the  light  line  and  approach  the  magnetostatic 
frequencies.  Also,  there  is  an  interesting  new  surface  polariton  branch  above  the  middle  bulk 
band  which  begins  at  the  light  line  and  approaches  the  outer  edge  of  the  upper  bulk  band. 

2.6  Nonreciprocity. 

The  nonreciprocity  of  frequency  with  respect  to  wavevector,  as  in  figure  2.4,  is  one 
of  the  most  striking  features  of  the  antiferromagnetic  surface  polaritons.  That  it  should  exist 
is  somewhat  surprising  since  short  wavelength  spin  waves  do  not  exhibit  this 
nonreciprocity.  The  origin  of  the  nonreciprocity,  though  not  completely  understood,  is  a 
phenomena  of  the  long  wavelength  excitations  and  is  due  to  the  dipolar  fields  and  the 
presence  of  a  surface. 

It  is  possible  to  construct  arguments  that  show  the  possibility  of  nonreciprocai 
behavior  without  requiring  its  existence.  One  such  argument,  originally  due  to  Scott  and 
Mills,  is  based  on  simple  symmetry  considerations.  Consider  first  a  material  extending 
infinitely  in  all  directions.  A  magnetic  field  is  applied  in  the  +z  direction  and  a  wave 
propagates  in  the  +kjj  direction.  Now  look  for  some  set  of  symmetry  operations  that  take  kj^ 
to  -k^  while  leaving  the  material  unchanged  with  respect  to  the  applied  field. 


24 


A  reflection  in  the  yz  plane  changes  to  -kj^  and,  since  the  magnetic  field  is  an  axial 
vector,  it  also  changes  to  point  along  the  -z  direction.  Another  reflection,  this  time  in  the  xz 
plane,  returns  the  field  to  its  original  orientation  but  leaves  kj^  pointing  in  the  -x  direction. 
These  operations  have  reversed  the  wavevector  and  have  returned  the  system  to  its  original 
state.  The  equality  0)(kjj)=co(-kjj)  must  hold  for  the  bulk  polaritons.  A  similar  argument 
shows  that  co(k2)=ci)(-k2). 

Now  suppose  a  semi-infinite  material  occupies  the  region  y>0  as  in  figure  2.6.  Here 
the  same  sequence  of  operations  fail  to  return  the  material  to  its  original  orientation  by  leaving 
the  material  in  the  upper  half  space.  It  is,  in  fact,  impossible  to  take  k^  to  and  leave  the 
material  in  place.  There  is  thus  no  requirement  that  cdGc2)=®('^x)  excitations  propagating 
in  the  semi-infinite  geometry.  This  argument  shows  the  possibility  for  nonreciprocai 
behavior  of  a  surface  wave  parallel  to  the  surface,  but  it  does  not  demand  it  Nonrcciprccity 
needs  to  be  investigated  case  by  case. 

Nonreciprocity  of  frequency  with  respect  to  wavevector  is  only  one  possibility  for 
nonreciprocal  behavior.  Another  possibility  for  nonreciprocal  behavior  is  in  the  reflection  of 
light  from  magnetic  materials  at  frequencies  where  the  material  is  transparent  In  this 
phenomena,  the  reflectance  due  to  a  wave  incident  at  some  angle  of  incidence  9  w'ould  not 
equal  the  reflectance  due  to  a  wave  incident  at  -9.  Any  nonreciprocal  reflectance  would  be 
due  to  the  energy  transmitted  into  the  material.  In  this  case  it  is  nonreciprocal  behavior  of  the 
bulk  polaritons  which  are  of  interest 

The  above  symmetry  arguments  ignore  the  polarization  of  the  bulk  polariton.  In  the 
next  chapter  the  reflectance  of  an  antiferromagnet  in  an  applied  field  is  examined  and  it  is 
found  that  while  the  energies  of  the  bulk  polaritons  are  reciprocal  with  respect  to  propagation 
direction,  the  polarizations  of  the  +k2  and  -k^  modes  can  lead  to  a  transmittance  for  an 
incident  wave  with  +k2  that  differs  from  the  transmittance  for  an  incident  wave  with  -k^. 


25 


a) 


t 


y 


original  configuration 


after  reflection  through 
z 

yz  plane 

◄ -  H 


Figure  2.6.  In  (a)  the  initial  configuration  of  the  semi-innnite  antiferromagnet  is  defined  with 
the  wavevcctor  k  of  the  excitation  in  the  x  direction,  the  material  in  the  y>0  half 
space,  and  the  applied  field  directed  into  the  paper.  In  (b),  the  system  has 
been  refleeted  through  the  yz  plane,  taking  k  to  -k  and  Kq  to  *Kq.  In  (c)  the 
system  has  been  reflected  through  the  zx  plane,  taking  back  to  its  initial 
(Erection  into  the  paper  and  leaving  k  in  the  -x  direction.  These  operations, 
however,  fail  to  return  the  crystal  to  its  original  state. 


26 


2.7  Damping  effects. 

Up  to  now,  only  bulk  and  surface  polaritons  in  non-dissipative  media  (t=<»  in  the 
equations  of  motion  (2.7  and  2.8))  have  been  considered.  This  has  been  appropriate  for 
enumerating  the  properties  of  these  excitations  as  trte  eigenmodes  of  the  magnetic  system 
and  has  simplified  the  analysis  by  allowing  ky  to  be  either  pure  real  or  pure  imaginary. 

By  ignoring  dissipation,  a  number  of  important  physical  processes  are  not  taken  into 
account  These  are  primarily  scattering  events  between  magnons  and  magnons,  magnons 
and  phonons,  etc.,  and  can  be  thought  of  in  a  gross  way  as  mechanisms  of  transferring 
energy  out  of  one  spin  wave  mode  into  other  spin  wave  modes  and  other  excitations  in  the 
crystal  lattice.  The  simplest  way  to  represent  these  energy  losses  is  by  introducing  a  finite 
lifetime  x  into  the  equations  of  motion.  The  advantage  of  this  representation  is  that  it  is 
mathematically  simple  to  deal  with.  It  is  limited,  however,  by  describing  only  relaxation 
processes  that  align  the  processing  dipoles  along  their  equilibrium  directions.  This  accounts 
for  interactions  between  the  spin  system  and  the  crystal  lattice,  but  does  not  include 
interactions  between  spins  that  would  tend  to  align  the  dipoles  to  local  fields.  A  slightly 
more  detailed  description  will  be  presented  in  chapter  3  when  a  damping  term  is  introduced 
into  the  equations  of  motion  that  describes  the  relaxation  of  the  spins  to  the  direction  of  the 
instantaneous  effective  fields. 

A  finite  t  leads  to  complex  susceptibilities.  The  real  part  of  the  susceptibilities 
governs  electromagnetic  wave  propagation  in  the  material  and  the  imaginary  part  determines 
the  absorption  of  electromagnetic  energy  by  the  material.  For  the  bulk  and  surface  polaritons 
described  above,  the  complex  susceptibilities  result  in  complex  wavevectors.  These  complex 
wavevectors  have  a  simple  interpretation;  the  real  part  corresponds  to  propagating  wave-like 
excitations,  and  the  imaginary  part  represents  excitations  that  decay  exponentially  as  they 
travel. 


27 


One  consequence  of  material  damping  is  a  slight  modification  of  the  bulk  and  surface 
polariton  frequencies.  The  surface  mode  frequencies  shown  in  figure  2.3,  for  example,  are 
shifted  slightly  up  and  the  bulk  mode  boundaries  are  also  affected  to  a  small  degree.  These 
effects  arc  due  to  the  shifting  of  the  resonance  pole  in  the  susceptibilities. 

By  far  the  most  drastic  effect  of  damping  is  the  existence  of  new  polariton  modes  in 
frequency  regions  otherwise  forbidden  to  polariton  excitations.  This  phenomena  has  not 
been  studied  in  antiferromagnetic  systems  before,  but  is  very  well  known  from 
plasmon-polariton  studies.  These  modes  are  surface  modes  in  the  sense  that  they  are  bound 
to  the  surface  of  the  material,  but  they  can  also  have  components  that  travel  into  the  bulk  of 
the  material.  These  components  are  attenuated  as  they  move  away  from  the  surface  due  to 
dissipation  into  the  crystal.  For  this  reason,  they  are  also  often  referred  to  as  "leaky  modes" 
since  they  leak  electromagnetic  energy  travelling  along  the  surface  into  the  interior  of  the 
crystal  where  it  is  eventually  absorbed  through  the  various  mechanisms  discussed  above. 

There  is  a  wealth  of  terminology  describing  the  leaky  modes  that  exist  on  the 
interface  between  two  dielectric  media.  The  surface  polariton  mode,  which  exists  even  in  the 
absence  of  damping,  is  termed  the  Fano  mode.  Modes  which  depend  on  damping  for  their 
existence  fall  into  three  classes:  Evanescent,  Brewster,  and  Zenneck  modes.  Evanescent 
modes  are  characterized  as  rapidly  attenuating  as  they  propagate,  with  a  path  length  on  the 
order  of  their  wavelength.  They  are  thus  difficult  to  observe.  They  also  are  unique  in  that 
they  exist  in  frequency  regions  forbidden  to  both  surface  and  bulk  polaritons.  Zennick 
modes  exist  in  frequency  regions  where  the  dielectric  function  is  almost  completely 
imaginary. 

The  properties  of  leaky  modes  in  dielectric  materials  are  listed  in  Table  n  in  terms  of 
the  relative  magnitudes  of  the  real  and  imaginary  parts  of  the  dielectric  function  and  the 
wavevector  parallel  and  perpendicular  to  the  surface.  ^  The  real  parts  of  the  wavevector 
describe  the  propagation  of  the  wave  and  the  imaginary  parts  determine  the  attenuation  of  the 
wave.  For  example,  the  imaginary  part  of  the  surface  polariton's  parallel  wavevector 


28 


component  is  much  smaller  than  the  real  part  Surface  polaritons  thus  have  very  long 
pathlengths.  The  perpendicular  component  however,  has  an  imaginary  part  that  is  much 
larger  than  the  real  part  Thus  surface  polaritons  are  tightly  bound  to  the  surface  of  the 
material.  By  comparison,  the  Brewster  modes  also  have  long  pathlengths  but  are  weakly 
bound  to  the  surface  of  the  material. 

Some  of  the  leaky  surface  modes  found  in  the  magnedc  system  are  analogous  to  the 
Brewster  surface  polariton  modes  found  in  dielectric  materials.  As  is  well  known,  for  a 
wave  incident  on  a  non-absorbing  dielectric  and  E  field  polarized  in  the  plane  of  incidence, 
there  exists  an  angle  of  incidence  for  which  there  is  no  reflected  wave.  Since  the  Brewster 
angle  depends  on  the  frequency  co,  it  provides  a  relationship  between  O)  and  k^,  the 
component  of  the  wavevector  parallel  to  the  surface.  With  damping  present  in  the  dielectric, 
one  can  find  a  solution  of  Maxwell's  equations  for  a  weakly  bound  surface  wave  which 
decays  as  it  propagates  parallel  to  the  surface.  This  surface  mode  has  a  dispersion  relation  co 
as  a  function  of  ReOc^)  which  is  very  similar  to  ihe  relationship  in  the  Brewster  case  and  is 
known  as  a  Brewster  mode.  In  the  antifenomagnetic  system,  a  very  equivalent  result  can  be 
found.  Here  a  Brewster  angle  occurs  for  a  geometry  where  the  magnetic  field  lies  in  the 
plane  of  incidence.  Again  when  damping  is  present,  a  magnetic  Brewster  surface  mode  can 
exist  with  a  dispersion  relation  similar  to  that  provided  by  the  relationship  of  the  magnetic 
Brewster's  angle  and  the  frequency. 

In  chapter  4,  a  detailed  discussion  of  the  antiferromagnetic  leaky  mexies  and  their 
properties  is  presented.  Since  these  mexies  depend  on  damping  for  their  existence,  it  is 
natural  to  question  the  form  of  the  damping  terms  included  into  the  equations  of  motion.  In 
equations  (2.7)  and  (2.8),  only  the  simplest  damping  terms,  Bloch-Bloembergen,  are 
represented.  ITiis  kind  of  damping  describes  a  relaxation  of  the  processing  magnetizations  to 
their  equilibrium  values. 


Table  H.  Properties  of  leaky  waves  in  dielectrics.  The  dielectric  function  is  e  and  the 

wavevector  parallel  to  the  surface  is  .  The  wavevector  component  normal  to  the 
surface  is  denoted  by  q  where  q=kj|^  mside  the  material  and  q=ky  in  the  vacuum. 


Ty^  e 


kj^  q  Phase  velocity 


"True"  Surface 
Polariton 

Re(e)<-1 

Im(e)«Re(£) 

Re(kjj)»Im(kjj) 

Re(q)«Im(q) 

<c 

Evanescent 

-l<Re(e)<0 

Im(e)»lRe(e)| 

Re(kx)=Im(kj^) 

Re(q)»Im(q) 

Brewster 

Re(e)>0 

Im(e)«Re(e) 

Re(kx)»Im(k3j) 

Re(q)»Im(q) 

>c 

Zenneck 


Re(e)«-1 

Im(e)»|Re(e)| 


Re(kjj)»Im(k^)  Re(q)»Im(q) 


>c 


30 


Alternatively,  one  can  describe  a  relaxation  to  the  instantaneous  value  of  the 
effective  fields.  The  required  expressions  can  be  derived  from  the  free  energy  and  are  called 
Landau-Lifshitz  damping  terms.  These  terms  are  calculated  and  included  in  the 
susceptibilities  in  Appendix  I.  Their  effect  on  the  antifeiromagnetic  leaky  modes  is  explored 
in  chapter  4. 

2.8  Surface  Roughness. 

Having  examined  the  nature  of  magnetic  excitations  in  the  antiferromagnet, 
described  the  coupled  photon  and  magnon  modes  that  exist  in  the  electromagnetic  field  of  the 
crystal,  and  considered  the  effects  of  damping  mechanisms  on  the  surface  electromagnetic 
excitations,  this  survey  of  surface  and  bulk  polaritons  is  nearly  complete.  Throughout  this 
investigation,  the  focus  has  been  on  quantities  important  for  the  experimental  observation  and 
potential  utility  of  these  excitations.  Continuing  in  this  manner,  the  following  discussion 
considers  the  effect  of  surface  roughness  on  polariton  excitations  and  the  possibility  of 
coupling  light  to  these  excitations  through  periodic  gratings  ruled  on  the  antiferromagnetic 
surface. 

Surface  roughness  provides  two  mechanisms  by  which  surface  polaritons  can  lose 
energy:  either  through  roughness  induced  radiation  or  through  roughness  induced  scattering 
into  different  polariton  states.  Which  mechanism  dominates  depends  on  the  polariton's 
frequency.  Thus  surface  roughness  can  affect  the  mean  free  path  length  of  polaritons 
propagating  on  the  surface  of  a  material.  These  mechanisms  have  in  fact  been  suggested  as 
explanations  of  certain  anomalies  in  attenuated  total  reflection  measurements  of  plasmon 
polaritons.^^  Another  effect  due  to  random  roughness  is  the  shifting  of  frequencies  of 
surface  polariton  states  below  their  flat  surface  frequencies,  thus  displacing  the  dips  in 
reflectivity  curves  by  amounts  dependant  on  the  distribution  of  heights  and  profiles  that 
characterize  the  roughness.  Again,  this  is  also  a  typical  consequence  of  dissipation 
mechanisms. 


31 


One  of  the  most  promising  aspects  of  surface  roughness  effects  is  the  ability  to 
couple  incident  light  with  surface  excitations.  Surface  polaritons  have  phase  velocities 
co/k  <  c  and  so  cannot  directly  couple  with  incident  light  Damping  in  the  material  allows 
one  possible  mechanism  for  coupling  by  giving  a  width  in  frequency  to  the  susceptibility 
poles.  Surface  modes  with  frequencies  and  wavevectors  very  near  the  light  line  can  then  be 
driven  in  an  "off-resonance"  maimer. 

Surface  roughness  creates  a  width  in  wavelength  analogous  to  the  width  in 
frequency  produced  by  damping.  A  very  simple  type  of  roughness,  a  periodic  grating, 
clearly  illustrates  this.  As  is  well  known,  light  incident  on  a  grating  is  diffracted  into  several 
directions  according  to  the  wavelength  of  the  light  and  period  of  the  grating.  Each  direction 
corresponds  to  a  diffracted  beam  with  a  different  parallel  wavevector  component .  The 
parallel  wavevector  component  of  the  scattered  light  is  given  by  k^^k^oi  nkg  where  k^o  is 
the  parallel  wavevector  component  of  the  incident  wave,  kg  is  the  grating  period  and  n  is  an 
integer.  The  grating  thus  scatters  the  incident  wave  into  an  infinite  number  of  parallel 
wavevector  states  that  differ  by  integral  multiples  of  the  grating's  spatial  period.  Many  of 
these  wavevector  states  kj^  satisfy  kjj<  oVc  and  so  correspond  to  radiative  states.  States  that 
have  k^>  co/c,  however,  decay  exponentially  away  from  the  surface  and  are  evanescent 
waves.  Energy  in  these  states  can  couple  directly  with  surface  polariton  modes  and  thus 
provide  a  means  of  transferring  energy  from  the  incident  light  wave  to  surface 
electromagnetic  excitations. 

This  process  is  analogous  to  the  scattering  of  electrons  in  periodic  potentials,  and 
one  can  construct  Brillouin  zone  representations  for  the  surface  polariton  modes  on  periodic 
gratings.^^’^^  This  is  shown  schematically  in  figure  2.7  where  grating  induced  "light  lines" 
are  superposed  on  the  dispersion  curve  for  a  surface  polariton.  Each  light  line  corresponds 
to  one  of  the  n  diffracted  beams  and  the  incident  beam  can  then  couple  to  a  surface  polariton 
everywhere  a  light  line  intersects  the  polariton  dispersion  curve. 


32 


Other  features  reminiscent  of  energy  bands  is  the  appearance  of  "band  gaps"  in  the 
surface  polaiiton  dispersion  relations  that  exist  at  the  Brillouin  zone  boundaries.  Also,  it  is 
possible  that  the  fields  veiy  near  a  rough  surface  are  enhanced  to  such  a  degree  as  to 
significantly  affect  nonlinear  optical  processes.^® 

By  allowing  an  incident  beam  to  couple  with  the  surface  polariton  modes,  surface 
roughness  can  enhance  the  nonreciprocal  reflectivity  of  an  antiferromagnet  in  an  applied 
field.  In  other  words,  in  an  applied  field  the  difference  in  reflectivity  between  a  beam 
incident  at  +9  and  a  beam  incident  at  -6,  both  of  which  couple  to  the  surface  polariton 
modes,  is  increased  by  the  presence  of  roughness. 

Since  leaky  modes  also  exist  in  regions  away  from  the  light  line,  roughness  can 
allow  coupling  between  an  incident  beam  and  the  Brewster-like  modes  of  a  damped 
antiferromagnet  A  theory  for  reflection  from  a  rough  antiferromagnetic  surface  is  presented 
in  chapter  5  with  emphasis  on  coupling  to  surface  excitations.  Numerical  results  are 
presented  for  reflection  from  a  sinusoidal  grating  and  a  randomly  rough  surface. 


k 


Figure  2.7.  In  analogy  to  energy  bands  for  electrons  in  semiconductors,  an  extended  zone 
scheme  can  be  us^  to  show  the  possible  excitations  of  surface  polariton  modes. 
Shown  here  is  a  typical  dispersion  curve  for  surface  polariton  modes  plotted  as 
frequency  vs.  wavenumber.  The  solid  straight  line  is  the  dispersion  curve  for 
light  in  vacuum  where  (o^ck.  Without  the  grating,  only  those  surface  modes 
which  cross  the  light  line  o>=ck  can  be  excited.  With  a  grating,  the  light  line  is 
split  into  an  infinite  number  of  parallel  lines,  each  with  slope  c,  spaced  a 
distance  nkg  apart  The  gra^g  then  allows  the  incident  light  to  excite  surface 
modes  anywhere  a  grating  line  intersects  the  surface  polariton  curve.  A  similar 
figure  results  for  light  incident  at  negative  incident  angles  except  that  all  the  light 
lines  then  have  negative  slope. 


CHAPTERS 


NONRECIPRCXAL  REFLECTION 

Nonreciprocal  reflection,  where  the  reflectance  of  a  material  for  some  angle  of 
incidence  6  differs  from  the  reflectance  at  -6,  has  been  observed  on  andferromagnets  by 
coupling  the  surface  polaritons  to  an  incident  electromagnetic  wave  and  measuring  the 
resulting  change  in  reflectance.^  This  actually  constituted  the  first  experimental  observation 
of  antiferromagnedc  surface  polaritons.  The  coupling  was  done  by  means  of  damping 
mechanisms  in  the  material  which  allowed  energy  to  dissipate  from  the  electromagnedc  wave 
into  the  surface  poiariton  modes.  An  interesting  quesdon,  which  is  the  topic  of  this  chapter, 
is  whether  nonreciprocal  reflecdon  can  exist  without  absorpdon  present  in  the  material.  It 
turns  out  that  nonreciprocal  reflecdon  with  respect  to  incident  angle  can  exist  without  the 
presence  of  material  damping  by  coupling  the  incident  electromagnedc  wave  direcdy  with  the 
bulk  poiariton  modes. 

3. 1  Possibility  of  nonreciptocal  reflectance. 

Detailed  balance  arguments  provide  one  means  of  showing  the  possibility  (or  lack 
thereof)  for  nonreciprocal  reflecdon.  One  such  an  argument,  originally  due  to  Remer  et  al., 
shows  that  nonreciprocity  in  reflecdon  is  not  possible  without  absorpdon  in  the  material.^  ^ 
First  define  the  reflectance,  R(0,(l>,co,T),  as  the  power  of  an  electromagnetic  wave  with 
frequency  between  co  and  OHdco  which  is  reflected  by  a  surface  element  dF  at  a  temperature 
T.  This  is  normalized  by  the  power  normally  incident  on  the  surface  element  dF  in  a  solid 
angle  in  the  direction  defined  by  (0,0)  with  frequency  between  co  and  CiHdco.  The  angle  0  is 
the  angle  the  radiation  makes  with  respect  to  the  normal  of  the  surface,  while  0  is  the  angle 


35 


the  reflection  plane  makes  with  the  magnetic  field  (see  figure  3.1).  The  direction  (8,<i>) 
always  refers  to  the  incident  wave.  The  transmittance  T(0,(l),co,T)  and  absorptance 
A(0,(1>,©,T)  are  defined  similarly. 

Now  imagine  a  blackbody  and  the  material  interchanging  radiation  at  temperature  T. 
At  equilibrium,  the  conservation  of  energy  and  detailed  balance  result  in  the  equality 


A(0)+R(e)=A(-0)+R(-^)=l 


(3.1) 


It  would  thus  seem  that  nonreciprocal  reflection  is  possible  only  in  the  presence  of 
nonreciprocal  absorption;  that  is,  R(0)-R(-0)^  if  A(0)9tA(-0).  It  has  been  pointed  out, 
however,  that  this  is  tme  only  for  the  net  incident  and  reflected  power,  but  not  necessarily  for 
the  individual  polarizations  of  the  incident  and  reflected  waves.^ 

The  argument  can  be  restated  in  more  general  form  by  defining  separate  reflectances, 
absorptances  and  transmittances  for  each  polarization  and  considering  sums  over  all 
polarization  states.  In  the  following,  the  polarization  state  of  the  incident  energy  is  denoted 
by  the  subscript  i  and  the  polarization  state  of  the  corresponding  reflected  or  transmitted 
energy  is  denoted  by  the  subscript  e.  Also,  the  absorptance  in  each  state,  Ajg,  is  assumed  to 
be  zero. 


Consider  an  infinite  nonabsorbing  slab  surrounded  on  both  sides  by  infinite 
blackbodies.  Suppose  both  blackbodies  and  the  slab  are  at  equilibrium  with  uniform 
temperature  T.  Each  blackbody  radiates  energy  l{g(0)  in  the  direction  0  with  polarization  i. 
For  this  geometry,  it  is  necessary  to  define  separate  reflectances  and  transmittances  for  each 
surface.  These  are  labelled  T  jjg,  Rug,  and  R2ie  as  illustrated  in  figure  3.2(a).  The 
numeric  subscript  on  any  quantity  identifies  which  surface  the  corresponding  incident  energy 
illuminatecL 


36 


*  Figure  3.1.  Reflection  geometry.  The  material  occupies  the  upper  half  plane,  y>0,  and  the 

surface  is  in  the  xz  plane.  The  saturation  magnetizations  and  applied  field  lie 

■  along  the  z  axis.  The  angle  of  incidence,  8,  is  the  angle  the  wavcvcctor  of  the 

■  incident  wave  makes  with  the  normal  to  the  surface.  The  plane  of  incidence 

makes  an  angle  <j>  with  respect  to  the  z  axis.The  incident  magnetic  fields  and 

represent  the  component  of  the  incident  magnetic  field  perpendicular  and 
pamel  to  the  plane  of  incidence.  The  reflected  magnetic  fields  and  Hp’’*  are 
defined  similarly.  Note  that  for  69^90,  there  are  two  transmitted  waves  wmeh 
are  denoted  by  the  wavevectors  k^  and  k2  with  magnetic  fields  and  H2. 


37 


TriO)  R,l(e) 


Figure  3.2.  The  net  transmitted  energy  emitted  by  the  slab  from  surface  1  is  and  the  net 
energy  reflected  by  surface  1  is  Rj.  T2  and  R2  are  defined  similarly.  The 
angles  0  and  -0  refer  to  the  angle  of  incidence. 


38 


T^Iq(6)  and  represent  the  transmitted  and  reflected  energy,  with  polarization  e,  due 

to  the  energy  incident  on  surface  1  at  an  angle  0  with  polarization  i.  T2ig(0)  and  R2ie(®)  ^ 
the  transmitted  and  reflected  energy  with  polarization  e,  due  to  the  energy  incident  on  surface 
2  at  an  angle  0  with  polarization  i. 

In  order  to  make  use  of  the  symmetry  of  this  problem,  the  energy  incident  in  the  +0 
direction  on  surface  1  illuminates  the  slab  firom  the  left  The  corresponding  Riie(^) 
leaves  the  surface  toward  the  right  Similarly,  the  energy  incident  in  the  +0  direction  on 
surface  2  is  defined  as  coming  from  the  right  The  corresponding  R2ie(9)  leaves  the 
surface  toward  the  left  Thermal  equilibrium  requires  the  energy  incident  on  surface  1  from 
the  upper  bit»,iCbody  to  equal  the  energy  incident  on  surface  2  from  lower  blackbody.  With 
these  definitions,  Riie(®)=^2ie(®^  likewise  for  the  transmittances.  The  subscripts 
identifying  the  surface  are  no  longer  necessary  and  are  neglected  for  simplicity. 

The  blackbodies  radiate  energy  in  all  polarization  states  equally.  For  equilibrium  to 
exist  in  all  polarization  states,  detailed  balance  requires  the  sum  over  all  incident  polarization 
states  of  the  energy  emitted  with  the  polarization  state  e  into  the  direction  0  equal  the  energy 
in  the  same  polarization  state  e  incident  in  the  direction  0.  The  energy  emitted  into  the 
direction  0  is  composed  of  reflected  energy  (from  energy  incident  in  the  direction  -9)  and 
transmitted  energy  (from  energy  incident  from  the  direction  0).  Thus: 


i 

(3.2) 

Similarly,  the  energy  emitted  into  the  direction  -0  (reflection  of  energy  incident  in  the 
direction  0  and  transmission  of  energy  incident  in  the  direction  -0)with  polarization  e 
summed  over  aU  incident  polarization  states  must  equal  the  energy  with  polarization  e 
incident  in  the  direction  -0.  The  energy  incident  at  -0  with  polarization  e  equals  the  energy 
incident  at  +0  with  polarization  e  and  the  following  equality  must  hold: 


39 


X(‘*i>w+T|.(-e))=Xc^«(-®)+Ti.W) 

i  i 

(3.3) 

This  equality  is  shown  schematically  in  figure  3.2(b)  for  the  polarization  states  r  and  1. 

A  second  equality  can  also  be  obtained.  In  figure  3.2(c)  the  energy  emitted  by  the 
slab,  due  to  energy  with  polarization  1  incident  in  the  direction  +0,  is  shown.  The  emitted 
energy  has  both  polarizations  r  and  1.  A  similar  picture  can  be  drawn  for  energy  emitted  due 
to  energy  in  state  1  incident  in  the  direction  -0.  The  blackbody  emits  power  equally  in  all 
directions  and  polarizations,  so  Ii(+0)=I^(-0).  For  the  arbitrary  incident  state  i,  the 
following  equality  thus  holds; 


2,  (Ri.{-8)+T..(-8))=.^  (R„{  8)+Ti.(0)) 

e  e 

(3.4) 

The  two  relations  (3.3)  and  (3.4)  have  comparable  forms  when  equation  (3.3)  is 
summed  over  emitted  states  e  and  equation  (3.4)  is  summed  over  incident  states  i.  In  order 
for  these  two  relations  to  be  consistent,  then 


X  crij:8)-T^(-8)=o 

w  e4 

(3.5) 

The  requirement  that  the  total  reflectance  due  to  all  incident  polarization  states  is  reciprocal 
with  respect  to  incident  angle  still  holds: 

e,i 


(3.6) 


40 


To  simplify  the  notation,  the  quantity  ARi=  Ig  (Rie(9)  -  Rie(-9))  is  defined.  Tf  there  are 
only  two  possible  polarizations  (say  i=r  and  i=l),  and  ARj.(9)^,  equation  C'.6)  is  still 
satisfied  if 


ARj.(9)=-  ARi(9) 


(3.7) 

This  means  reflection  within  one  polarization  can  be  nonreciprocal  if  reflection  within  at  least 
one  of  the  other  polarizations  is  nonreciprocal.  This  equality  will  be  demonstrated  by  a 
specific  example  in  section  3.3. 

The  remainder  of  this  chapter  is  organized  in  the  following  manner  In  section  3.2  a 
theory  is  constructed  for  the  reflectance  and  transmittance  of  an  electromagnetic  wave 
incident  on  a  semi-infinite  antiferromagnet  in  the  presence  of  an  applied  field.  This  is  done 
for  arbitrary  angles  of  incidence  and  arbitrary  polarizations  of  the  incident  wave.  In  section 
3.3,  specific  results  are  presented  from  numerical  calculations  of  reflection  from  MnF2,  a 
uniaxial  antiferromagnet,  which  are  used  to  show  that  nonreciprocal  reflection  can  occur  for 
certain  polarizations  of  the  incident  wave.  Linearly  polarized  incident  waves  are  found  to  be 
reflected  reciprocally  with  respect  to  the  angle  of  incidence  9,  but  circularly  polarized  incident 
waves  are  not.  It  will  be  seen  that  the  nonreciprocity  of  the  circular  reflectances  will  satisfy 
the  equality  (3.7). 


41 


3.2  Theory. 

The  geometry  is  defined  in  figure  3.1.  The  material  occupies  the  y>0  half  space 
with  the  easy  axis  along  z.  .Ar.  appiied  field  also  lies  along  the  z  axis.  The  plane  of  incidence 
makes  an  angle  ()>  with  the  z  axis  and  the  incident  wavevector  k  makes  an  angle  6  with  respect 
to  the  outward  normal  from  the  surface.  The  incident  wavevector  is  k<=  (kjj<,  ky<,  k^*^) 
and  the  incident  magnetic  field  is  tf.  The  reflected  wavevector  is  k^=  ky’^,  k^O  and  the 

magnetic  field  is  If. 

The  dielectric  properties  of  the  material  are  determined  by  e  and  p.  as  defined  by 
equations  (2.3)  and  (2.9).  Plane  wave  solutions  of  the  form  e^(k‘X-a)t)  assumed.  In 
explicit  matrix  form,  equation  (2.1 1)  then  becomes 


<  -^ky 

2  l.2 

+kj) 

e 

(icoJp  e  +  -^k  ky) 

0  2  1  p  \  } 

s 

k,  k  V ) 

p  A  7 

^2 

-  (“^  k^  +  kj  )  'K 

®2 

^z^y  -(ky^  +  l4) 

kj  and  k2  are  given  by 


(3.8) 


(3.9) 


(3.10) 


42 


The  dispersion  relation  of  bulk  polaritons  propagating  in  the  material  is  found  by 
setting  the  determinant  of  this  matrix  to  zero.  In  the  special  case  k2=0,  the  matrix  simplifies 
considerably  and  an  explicit  expression  for  ky(co»k^)  is  obtained: 


(3.11) 

This  equation  describes  the  propagation  of  bulk  polaritons  in  a  direction  perpendicular  to  the 
easy  axis.^^  Note  that  there  is  only  one  value  of  ky  for  each  k^  and  to. 

When  k^?*  0,  the  determinant  of  the  matrix  in  (3.8)  is  a  second  order  polynomial  in 
(ky  )^so  that  ky  has  two  possible  magnitudes  at  each  firequency.  A  wave  incident  on  the 
material  can  then  transmit  energy  into  two  bulk  polariton  modes,  each  mode  travelling  in  a 
different  direcdon  but  with  the  same  frequency  as  the  incident  wave. 

Outside  the  material,  where  y  <  0,  the  susceptibilities  are  uniform  and  describe  a 


vacuum: 


(3.12) 


(3.13) 


Substitution  of  (3.12)  and  (3.13)  into  the  wave  equation  (3.7)  gives  the  usual  wave  equation 
for  H  in  free  space.  With  plane  wave  solutions  of  the  form  the  free  space 

dispersion  relation  is  simply  a)Q=  k"^.  Similarly,  the  reflected  wavevector  obeys  Ci)q=  k^. 

The  parallel  components  of  the  wavevector  must  match  across  the  interface  between 
the  vacuum  and  the  material.  Thus 


kx=k/=kx< 


(3.14) 


43 


Ic2=k/=k2< 


(3.15) 


These  two  relations  plus  (i)Q=k<=!k^  allow  us  to  write  k*‘=(kjj,  -ky<,  k^). 

The  amplitudes  of  the  transmitted  and  reflected  waves  can  be  found  in  terms  of  the 
incident  wave  by  applying  the  usual  electromagnetic  boundary  conditions.  The  existence  of 
two  waves  in  the  material  (conesponding  to  the  two  ky's)  means  that  there  are  three 
undetermined  fields  in  the  problem:  IF,  and  two  transmitted  fields  and  H2.  In  terms  of 
components,  this  means  there  are  nine  amplitudes  to  be  related  through  boundary  conditions 


aty=0. 


The  usual  Maxwell  boundary  conditions  on  tangential  it  normal  ^  and  tangential 


provide  five  independent  equations.  Four  more  equations  are  needed  to  determine  the  fields. 
It  turns  out  that  the  boundary  condition  on  continuity  of  normal  across  the  interface  can  be 

expressed  as  a  linear  combination  of  the  boundary  conditions  on  tangential  H,  so  this  does 
not  yield  any  new  relations.  Instead,  the  remaining  four  equations  are  obtained  from  the 
equations  of  motion  (3.8). 

For  arbitrary  directions  of  propagation,  the  equations  of  motion  (3.8)  couple  the 

•«a^ 

components  of  the  H  fields  in  the  material.  This  means  two  of  these  three  equations  can  be 
used  to  write  and  Hy  in  terms  of  H^.  Since  there  are  two  Infields  in  the  material,  there 
are  four  new  relations  involving  the  amplitude  of  the  fields  (two  equations  for  each  ky). 
Writing 


Hy  'LBr 


(3.16) 


44 


A  matrix  equation  for  the  coefficients  A  and  B  is  obtained  fiom  (3.8): 


(3.17) 

(Note  that  any  two  of  the  three  equations  in  3.8  could  have  been  chosen).  One  should 
emphasize  that  each  ky  results  in  a  unique  set  of  coefficients  A  and  B.  These  two  sets  are 
labelled  by  the  subscripts  "1"  and  "2”:  kyj,  Aj,  Bj,  and  ky2,  A2,  B2, 

The  five  Maxwell  boundary  conditions  are  next  written  in  the  above  notation. 
Continuity  of  tangential  I^across  y=0  gives 


H^4H^^-AiH^i-A2H^=0  (3.18) 

H24H/-H2i-H22=0  (3.19) 

Continuity  of  normally  gives 


^^y*+Hy’^+iM'2(^lHzl‘^^2^^)*M-l(BiH2i+B2H22)=0  (3.20) 

Continuity  of  tangential  E,  written  in  terms  of  the  H  fields  via  cVxH=^/gp,  gives  the  two 
relations 


ky '^(H2^-H20-k2‘^(Hy^+Hy0-(ky  +ky  2H22-k2 1 B 1H2 1  -k22B2H22)/£  1 =0 


(3.21) 


45 


kx(Hy4Hy^-ky<(Hx'-H/)-(kxBiH2l+kxB2H22-kyiAiH^  (3.22) 


These  five  equations  are  sufficient  to  determine  Hj  and  H2  for  a  given  co,  <j) 
and  6.  For  a  numerical  solution  it  is  convenient  to  rewrite  equations  (3.18-3.22)  in  matrix 


10  0  -A^  -A^ 

0  0  1  -1  -1 

0  1  0  (4i^-lijBj)  0112^2-^^2^ 

0  -k;  -(k„-k.B,yE,  -OvABj)*, 

ky  0  -(k^  Bj-k^jAjj/tj  -(kj  Bj-k^  AjVEj 


Hz. 


(3.23) 


A  numerical  solution  goes  as  follows:  first  those  kyj  and  ky2  that  cause  the 
determinant  of  (3.8)  to  vanish  are  found.  Note  that  ky  j  and  ky2  must  have  positive  real 
parts  in  order  to  represent  waves  travelling  away  firom  the  surface.  Next,  the  matrix 
equation  (3.17)  is  solved  for  the  vectors  containing  A  and  B.  This  is  done  once  for  ky^  and 
once  for  ky2,  resulting  in  values  for  Aj,  Bj,  A2,  and  B2.  The  matrix  equation  (3.23)  is 
then  solved  for  the  vector  containing  and  H22*  Finally,  equation  (3.16)  is  used  to 

determine  the  x  and  y  components  of  the  two  transmitted  fields. 


46 


The  last  task  is  to  calculate  reflectances  and  transmittances.  The  reflectance  is 
defined  as  the  ratio  of  the  reflected  energy  flux  normal  to  the  surface  to  the  incident  energy 
flux  normal  to  the  surface.  Using  the  Poynting  vector  S=(ExH*)/47tc  this  is  simply 

rJ|1! 

(3.24) 

The  transmittance  is  defined  as  the  ratio  of  the  uransmitted  energy  flux  normal  to  the  surface 
to  the  incident  energy  flux  normal  to  the  surface.  Since  the  material  is  gyrotropic,  the  energy 
flow  is  not  in  the  direction  of  propagation.  Also,  since  there  are  two  waves  in  the  material, 
there  are  also  two  transmittances.  The  transmittance  then  has  the  more  complicated  form: 

fijky 

(3.25) 

The  subscript  "n"  can  have  the  value  1  ot  2  and  identifies  the  corresponding  transmitted  field 
and  wavevector  according  to  the  previous  convention. 

It  is  convenient  for  the  following  discussion  to  represent  the  incident  and  reflected  H 
fields  in  terms  of  components  parallel  and  perpendicular  to  the  plane  of  incidence.  The 
incident  field's  parallel  component  is  defined  as  and  its  perpendicular  component  is 

defined  as  The  transformation  from  incident  parallel  and  perpendicular  components  to 
cartesian  components  is  given  by 


Hjj'=Hp^cos9  sincjn-Hg^cosi]) 

(3.26) 

Hyi=-Hpisin0 

(3.27) 

47 


and 


H2^=Hp*cos6cos{()  -H5^in(|) 


(3.28) 


Since  the  theory  calculates  the  reflected  fields  in  terms  of  their  cartesian  components, 
the  reverse  transformation  is  required.  Again  referring  to  the  geometry  of  figure  3.1,  the 
parallel  (HpO  and  perpendicular  components  of  the  reflected  fields  are  given  by 


Hg^=Hx^cos(>-H2^sin(|» 


(3.29) 


Hpr=-Hy7sin8 


(3.30) 


3.3  Results. 

The  preceding  theory  is  now  applied  to  the  specific  example  of  reflection  &om  the 
antifem)magnet  MnF2.  This  is  a  uniaxial  antiferromagnet  with  the  parameters  Hgx=550  kG, 
Hj=7.87  kG,  and  M5=.6  kG.  Damping  is  ignored.  The  following  results  are  presented  in 
the  unitless  variables  (O/Q  and  ck/Q  where  Q  is  the  antiferromagnetic  resonance  frequency 
defined  in  (I3iaptcr  2. 

The  primary  goal  is  to  establish  conditions  for  nonreciprocal  reflection  with  respect 
to  the  incident  angle  0;  i.e.,  conditions  such  that  R(0)^R(-0).  As  discussed  in  the 
introduction,  the  two  propagation  directions  +k  and  -k  (corresponding  to  40  and  -  0)  are 
equivalent  when  there  is  no  applied  field.  It  is  only  when  an  applied  field  is  present  that  +k 
and  -k  propagation  directions  can  lead  to  nonreciprocal  properties.  A  static  external  field  of 
.3kG  is  thus  applied  in  the  z  direction. 

In  this  applied  field,  electromagnetic  waves  can  propagate  in  MnF2  if  they  have 
frequencies  below  co/f2=.998,  between  a)/f2=1.002  and  co/f2=l.()09,  or  above  co/G  =1.012. 
These  frequency  regions  are  often  referred  to  as  the  bulk  polariton  bands  and  are  defined  as 


48 


the  regions  where  the  normal  component  of  the  wavevector  in  the  material  (equation  2.5)  is 
real. 

In  figure  3.3,  the  reflectance  R  and  the  two  transmittances  and  T2  are  plotted  as 
functions  of  incident  angle  for  a  linearly  polarized  incident  wave.  The  incident  H  field  lies  in 
the  plane  defined  by  and  the  frequency  is  co/Q=.990,  well  within  the  lower  bulk 

polariton  band.  The  plane  of  incidence  makes  an  angle  i]>»4S^  from  the  z  axis.  Note  that 
most  of  the  transmitted  energy  has  gone  into  the  T2  bulk  mode  although  at  angles  near 
0=90®,  the  Tj  bulk  mode  is  also  excited. 

Most  importantly,  however,  R(0)=R(-0)  so  the  reflectance  is  reciprocal.  This 
reciprocity  occurs  for  all  angles  ^  and  all  orientations  of  the  plane  of  polarization  of  the 
incident  wave.  The  only  difference  for  difierent  orientations  of  the  plane  of  polarization  of 
the  incident  wave  is  in  the  reladve  amounts  of  energy  transmitted  into  the  T^  and  T2  bulk 
modes.  T^  has  its  maximum  for  the  above  case  where  and  T2  is  maximum  for  the 

case 

A  very  different  situation  occurs  when  the  incident  wave  is  circularly  polarized.  In 
figure  3.4,  AR=R(0)-R(-0)  is  shown  for  a  left  circularly  polarized  incident  wave  where 
Hp^=iH5^  The  frequency  is  co/f2=.990  and  the  applied  field  is  .3kG.  The  reflectance  is 
clearly  nonreciprocal  with  respect  to  0.  The  magnitude  of  AR  increases  with  0  until  it  reaches 
a  maximum  near  <t)Q=75®.  At  0=90®,  the  incident  wave  travels  parallel  to  the  surface  and  R 
vanishes.  Consequendy,  beyond  ({>q,  AR  must  also  tend  to  zero. 

This  nonreciprocal  behavior  can  be  understood  through  the  following 
consideradons.  First,  recall  that  magnedc  waves  in  the  material  drive  the  magnedzadons  of 
the  sublatdces  through  Bloch's  equations  of  motion.  This  results  in  a  precession  of  the 
magnetizations  about  the  z  axis  and  gives  rise  to  magnetic  fields  which  rotate  in  the  xy  plane. 
One  might  then  expect  this  rotation  to  affect  the  polarization  of  the  bulk  modes. 


49 


e 


m 

i 

tBgure  3.3.  Reflectance  and  transmittance  for  a  linearly  polarized  wave  incident  at 

<|»s45®  as  functions  of  incident  angle  6.  The  frequency  is  ciy^.9W.  The  two 
transinittances,  Tj  and  T2,  correspond  to  the  two  transmitted  waves.  The 
reflectance  for  +6  is  idenncal  to  that  of  -6. 


51 


This  in  turn  may  mean  the  bulk  modes  are  most  easily  excited  by  fields  with  certain 
polarizations,  thus  leading  to  a  dependence  of  the  transmittance  on  the  polarization  of  the 
driving,  or  incident,  wave.  This  is  punued  by  examining  the  polarization  of  the  bulk 
polariton  modes  through  the  equations  of  motion  (3.8). 

Consider  propagation  in  the  yz  plane  where  kj^^O.  As  long  as  HqjO,  the  three 
components  of  the  bulk  fields  H^,  Hy,  and  are  coupled  together  through  p.2* 

components  are  related  as  follows: 


and 


(3.32) 


z 


The  quantities  p(k2)  and  i^k^)  are  given  by 


(3.33) 


(E,/ej)k;+kj 


and 


(3.34) 


(3.35) 


52 


The  fields  of  a  bulk  mode  in  the  vector  form  are 


HOc^)  =  ( ip(k2)Tl(k2)/k2 ,  Tl(kz)/k2 . 1 )  H^Ck^) 

(3.36) 

This  represents  an  elliptically  polarized  wave  whose  magnetic  field  rotates  in  both  the  xy  and 
xz  planes.  Now  let  go  to  -kj  and  observe  the  resulting  polarization. 

First,  note  that  the  normal  component  of  the  wavevector,  ky,  is  determined  by 
setting  the  determinant  of  the  equations  of  motion  equal  to  zero,  as  before.  This  results  in  a 
polynomial  which  is  a  quadratic  in  Thus  ky(k2)=ky(-k2).  The  coefficients  pCk^)  and 
TKk^)  depend  only  on  and  ky,  so  p(k2)=p(-k2)  and  Ti(k2)=Ti(-k2).  Finally,  from 
equation  (3.28)  it  is  seen  thatH^  is  unchanged  as  0  goes  to  -0  (which  is  equivalent  to  letting 
kj  go  to  -kj).  The  resulting  bulk  H  field  is 

H(-kjW-ip(kj)ii(kj)/kj,  l)Hj(kj) 

(3.37) 

The  wave  is  still  elliptically  polarized  but  the  direction  of  rotation  in  the  xz  plane  is  reversed 
from  H(k2).  The  direction  of  rotation  in  the  xy  plane,  however,  is  the  same  as  that  of  H(k2). 

To  see  how  this  might  lead  to  nonreciprocal  transmission  and  reflection,  suppose  the 
incident  wave  is  travelling  in  the  yz  plane  with  ^-k^  and  is  circularly  polarized  with  x  and  y 
components  and  Hy^=i.  An  identical  circularly  polarized  wave  travelling  in  the  yz 

plane  but  with  -kj,  would  need  to  have  Hjj^=l  and  Hy^=-i  in  order  to  preserve  the  sense  of 
rotation  of  the  field  about  the  wave's  direction  of  propagation. 

An  incident  wave  with  a  given  sense  of  rotation  should  find  it  easier  to  excite  waves 
in  the  material  that  have  the  same  sense  of  roiation.  A  measiue  of  the  incident  wave's  ability 
to  excite  bulk  modes  in  the  material  is  the  magnitude  of  the  product  (IF)*  -H. 


53 


Using  the  above  circularly  polarized  incident  wave  and  arbitrarily  setting  the  z 
component  to  unity  for  both  the  and  -k^  directions,  one  obtains  for  the  +k^  direction: 


(3.38) 

For  the  -k^  direction,  the  corresponding  expression  is: 


(3.39) 

Suppose  p  is  negative  for  one  of  the  transmitted  waves.  Equation  (3.38)  is  then  less  than 

(3.39)  and  the  transmittance  into  this  wave  should  be  greater  for  incidence  with  -k^  than  with 
H-k^.  If  the  other  bulk  polariton  also  has  negative  p,  or  simply  has  smaller  magnitudes  for  p 
and  q,  then  the  reflectance  will  be  greatest  for  incidence  with  +k2. 

In  the  example  of  figure  3.4,  an  examination  of  the  numerically  calculated  A  and  B 
coefficients  show  that  the  bulk  polariton  modes  corresponding  to  each  value  of  ky  have 
opposite  senses  of  rotation  and  different  magnitudes  in  the  xy  plane.  The  bulk  polariton 
with  the  largest  x  and  y  components  is  in  this  case  is  right  circularly  polarized  for  -k^.  Thus 
the  transmittance  is  greater  for  than  -k^  and  the  reflectance  is  less  for  +9  than  for  -9. 

On  the  other  hand,  a  right  circularly  polarized  incident  wave  should  reverse  the 
situation  and  transmit  more  energy  into  the  bulk  modes  at  -k^  than  at  +k^.  This  is  seen  in 
figure  3.5  where  AR=R(9)-R(-9)  is  shown  for  a  right  circularly  polarized  incident  wave  at 
co/i2=.990  and  <j)=45®.  Here  AR  is  positive,  indicating  a  greater  transmittance  at  -9  than  at 
+9. 


S4 


I 

I 


0 


Figure  3.5.  The  change  in  reflectance,  AR(0)=R(0)*R(-0),  for  a  right  circularly  polarized 
wave  as  a  function  of  incident  angle  0.  The 

frequency  is  a^n  tiyfl=.990.  Here  AR(0)  is  positive  implying  that  the 
reflectance  for  -0  is  less  than  that  of  +0. 


55 


From  figures  3.5  and  3.6  it  is  clear  that  the  AR(6)  for  left  circular  incident  light  is 
exactly  equal  and  opposite  the  AR(0)  for  right  circularly  polarized  incident  light  This  is 
precisely  the  prediction  of  the  thermodynamic  argument  of  section  3. 1.  Since  linearly 
polarized  light  consists  of  equal  amounts  of  right  and  left  circularly  polarized  light,  the  net 
change  in  reflectance,  AR^6)+AR|(6)  must  vanish  by  the  same  argument .  This  explains 
why  the  linearly  polarized  incident  wave  of  figure  3.3  has  the  same  reflectance  for  +6  and 
—6. 

If  the  nonreciprocity  of  a  circularly  polarized  incident  wave  depends  on  the 
magnitude  of  its  x  and  y  field  components,  then  AR  should  depend  on  The  magnitudes  of 
these  components  are  largest  for  small  ^  and  large  0.  In  figure  3.6  AR  for  (1)=10°,  45®,  and 
80®  is  plotted  for  a  right  circularly  polarized  incident  wave  of  frequency  Ci>/Q=.995.  It  is 
clear  that  AR  increases  as  ((>  approaches  0.  Also,  in.  all  three  cases  AR  is  largest  for  large  0. 

Note  that  AR  is  much  larger  for  all  three  of  the  cases  of  figure  3.6  due  to  the  higher 
frequency  of  the  incident  wave.  The  frequency  (a/ii=.995  is  much  nearer  the 
antiferromagnetic  resonance  frequency  so  ji2  is  ntuch  larger  than  at  co/f2=.990.  This  means 
an  increased  coupling  between  and  Hy  for  co/Q=.995  and  a  consequent  increase  in  the 
nonreciprocal  behavior  of  the  reflectance.  In  regions  where  and  \i2  are  large,  the  rotation 
direction  of  the  bulk  modes  as  well  as  their  magnitudes  in  the  xy  planes  can  change 
drastically.  This  can  allow  a  wave  incident  at  one  +0  to  drive  one  of  the  bulk  modes  strongly 
while  at  another  still  positive  0,  the  other  bulk  mode  is  driven  strongly. 

The  fact  that  the  two  transmitted  waves  have  opposing  directions  of  rotation,  and 
that  their  magnitudes  depend  on  frequency,  allows  the  change  in  reflectance,  AR,  to  change 
sign  at  different  frequencies  and  angles  of  incidence.  In  figure  3.7  AR  is  plotted  for  right 
circularly  polarized  light  at  co/£2= 1.006  and  ci)/fl=1.015  with  4»=45®.  In  both  cases,  fij  and 
)i2  arc  relatively  large  so  that  the  nonreciprocal  reflection  is  also  well  pronounced.  Here  note 
the  changing  sign  of  AR.  At  small  0,  AR  is  small  and  positive  and  at  large  0,  AR  is  large 
and  negative. 


0 


30 


60 


90 


e 


Hgure  3.6.  The  change  in  reflectance,  AR(0)=R(0)-R(-0),  for  a  right  circularly  polarized 
wave  (Hj  snHp^)  incident  at  (I)=10®,  45°,  and  80°  as  functions  of  incident 
angle  0.  The  nequency  is  (iQ/£2s.99S.  AR(0)  is  largest  for  incident  waves 
travelling  along  the  direction  of  the  magnetic  field.  Note  that  AR(0)  is  much 
larger  at  this  frequency  than  at  (alQr=.990. 


57 


Figure  3.7.  The  chanw  in  reflectance,  AR(0)=R(0)-R(-0),  for  a  ri^t  circularly  polarized 
wave  (Hg^iHp^)  incident  at  (|>s45®  as  a  function  of  incident  angle  0 .  The 
frequencies  aife  avi2=1.006  and  oyii=1.015.  These  two  frequencies  lie  within 
the  nuddle  and  upper  bulk  polaiiton  bands,  respectively.  Note  that  AR(0)  has 
both  positive  and  negative  values  for  each  case,  corresponding  to  the 
complicated  behavior  of  the  bulk  modes  near  the  antifenomagnetic  resonance 
frequencies. 


58 


The  strong  nonreciprocity  in  R  coincides  with  a  strong  nonreciprocity  in  the 
transmittances.  This  is  illustrated  in  figure  3.8  where  R,  Tj  andT2  are  plotted  as  functions 
of  9  for  the  co/£2=1.006  case  of  figure  3.7.  The  relative  magnitudes  of  the  two  transmitted 
waves  are  reversed  as  0  goes  to  -9,  indicating  the  opposite  rotation  directions  of  the  two 
waves. 

The  large  difference  between  the  magnitudes  of  the  two  transmittances  of  figure  3.8 
indicates  that  the  largest  AR  should  be  found  by  examining  separately  the  left  and  right 
circularly  polarized  components  of  the  reflected  wave.  If  the  reflected  wave  H^=(H5^,  HpO 
is  written  in  terms  of  right  and  left  handed  circularly  polarized  unit  vectors  r  andt,  then  a 
simple  transformation  from  the  (s,p)  basis  to  the  (r,l)  basis  yields 


Iir=ar+bi  (3.40) 

The  squared  magnimdes  of  the  complex  coefficients  a  and  b  are  the  reflectances  of  the  right 
and  left  circularly  polarized  components.  These  are  given  by 

Rr=a*a/(a*a+b*b)=[IHsr|2+IHpri2+2IHs*’llHp*’lsin(ap-as)]  (3.41 ) 


and 


Rl=b*b/(a*a+b*b)=  [IHsr|2+lHpri2^2IHsrilHpr|sin((Xp-as)]  /21Hr|2  (3.42) 

The  phases  Op  and  ttg  are  defined  by 


r— -/TT  TNm./TT  T\ 

iaiiap=irn(np-  )i!sa\n^- ) 

(3.43) 

tana5=Im(H50/Re(H5^) 

(3.44) 

59 


0 


9  Bguie  3.8.  Reflectance  and  transmittance  for  a  right  circularly  polarized  wave  (H3^=iHp^) 

incident  at  (|>=4S°  as  funcdons  of  incident  angle  6.  The  frequency  is 

fciyQ=1.006.  The  two  transmittances,  Tj  and  T^,  correspond  to  the  two 

transmined  waves.  Tj  is  much  larger  than  Tn  for  -8  while  T2  is  larger  than  T  j 
for  +9.  The  net  transmittance,  and  hence  reflectance,  is  still  nonreciprocal, 

H  however. 


60 


A  comparison  of  ARf(9)=Rj.(6)-Rj.(-9)  and  ARi(9)=Rj(9)-Rj(-9)  shows  that  the 
nonreciprocity  is  due  largely  to  scattering  within  one  polarization.  At  co/f2= 1.006,  for 
example,  ARj(O)  is  nearly  zero  for  all  9  and  AR(9)  =  ARj.(9). 

The  primary  purpose  of  this  chapter  has  been  to  show  that  nonreciprocal  reflection 
can  exist  without  absorption  by  the  material.  In  practical  considerations,  absorption  is  of 
course  always  present  to  some  degree,  and  it  is  of  interest  to  examine  the  effects  of  damping 
on  the  nonreciprocity  of  tlie  reflectance.  When  Bloch  damping  is  included,  it  is  found  that 
AR  is  increased  a  small  amount.  The  most  noticeable  effect,  however,  is  the  shifting  of  the 
angle  where  AR=0  for  frequencies  where  AR  changes  sign.  For  a  damping  value  of 
1/Tf2=.0002,  co/Q=1.006  and  an  applied  field  of  .3kG,  this  shift  is  3°. 


CHAPTER  4 


POLARTTONS  WITH  DAMPING 

The  goal  of  this  chapter  is  to  discuss  the  effects  of  damping  on  polariton  modes  in 
antiferromagnets.  There  are  several  reasons  for  this.  Damping  often  plays  a  critical  role  in 
the  interaction  of  external  probes  with  polaritons.  For  example,  in  the  reflection  experiment 
mentioned  earlier,  damping  is  necessary  for  a  coupling  of  the  external  electromagnetic 
radiation  to  the  surface  polariton  modes.  Also,  as  discussed  in  chapter  2,  not  only  does 
damping  have  a  significant  impact  on  the  surface  polariton  modes,  but  the  inclusion  of 
damping  also  leads  to  the  existence  of  new  "leaky"  surface  modes. 

This  chapter  begins  with  the  effects  of  damping  on  the  semi-infinite 
antifcnomagnetic  polariton  curves  and  discusses  the  properties  of  the  magnetic  Brewster 
mode.  The  effects  on  the  leaky  modes  from  using  an  alternative  damping  mechanism  is  also 
considered. 

4.1  Surface  polaritons  with  Bloch-Bloembergen  damping. 

A  more  realistic  description  of  surface  polaritons  is  obtained  by  including  a  damping 
mechanism  in  the  equations  of  motion.  In  general,  damping  affects  the  degree  of  localization 
of  the  wave  to  the  surface  and  causes  the  wave  to  lose  energy  as  it  propagates.  Although 
damping  and  its  effects  on  surface  polaritons  has  been  studied  for  plasmon  polaritons  and 
magnetoelastic  polaritons  on  fenomagnets,  antiferromagnetic  surface  polariton  studies  have 
only  considered  the  idealized  case  of  zero  damping.  In  this  section  damping  is  included  in 


62 


the  description  and  the  resulting  effects  on  the  surface  polariton's  dispersion  curve,  path 
length  and  penetration  depth  are  examined. 

In  addition  to  modifying  the  properties  of  the  "true"  antiferromagnetic  surface 
polaritons,  the  inclusion  of  damping  shows  the  existence  of  new  surface  resonances,  or 
"leaky"  surface  modes.  These  are  very  different  in  character  from  the  modes  found  without 
damping.^^  The  "true"  surface  modes  and  the  leaky  modes  are  investigated  by  numerically 
examining  the  surface  polariton  dispersion  relation  (2.15)  for  the  material  MnF2  with  the 
parameters  Hg=550kG,  Hj=7.87kG,  M=.6kG  and  £2=5.5. 

As  discussed  in  chapter  2,  one  obtains  the  dispersion  relation  by  assuming  plane 
wave  solutions  of  the  form  exp(i(kj^x-cot))exp(-ay)  inside  the  material  and 
exp(i(kjjX-cot))exp('vy)  outside  the  material  (the  geometry  is  the  same  as  that  used  in  the 
previous  chapters  with  the  material  in  the  y>0  half  space).  The  fields  inside  and  outside  the 
material  are  then  matched  according  to  Maxwell's  boundary  conditions  on  tangential  H  and 
normal  B.The  resulting  implicit  expression  was  derived  by  Camley  and  Mills:^ 

Yf(liia+^2kx)/(lii^-H2^)=0  (4.1) 

kx  is  again  the  wavevector  parallel  to  the  surface  and  decay  in  the  y  direction  is  governed  by 
a  and  y  (defined  in  (2.13)  and  (2.14))  which  are  given  by: 


(4.2a) 

and 


(4.2b) 


63 


When  there  is  no  damping  in  the  material,  equation  (4. 1)  can  be  satisfied  by 
appropriate  choices  of  real  frequencies  and  real  wavevectors.  In  this  case  both  a  and  y  are 
positive  and  real.  These  modes  represent  excitations  that  are  bound  to  the  surface,  with 
infinite  lifetimes.  Also,  when  Hq=0  the  modes  are  reciprocal  in  k.  This  means  the  solutions 
obey  (o(k)=Q)(-k).  In  the  presence  of  an  applied  field  the  modes  are  not  reciprocal  and  the 
solutions  obey  ci)(k)#ci)(-k). 

Bulk  modes,  on  the  other  hand,  exist  in  frequency  ranges  where  a  is  pure 
imaginary.  One  sees  from  (4.2)  that  these  ranges  exist  for  those  cojcx  such  that  a^<0.  In  an 
infinite  geometry,  all  values  of  a  are  allowed  and  so  the  number  of  bulk  modes  is  infinite  in 
each  bulk  band.  Also,  the  bulk  modes  are  reciprocal  in  k^  both  with  and  without  an  applied 
Held. 

When  damping  is  present  in  the  magnetic  system  (through  x  in  the  susceptibilities  of 
(2.3)  and  (2.4)),  the  dispersion  equation  (4.1)  no  longer  possesses  pure  real  wavcvector  and 
frequency  solutions.  The  dispersion  relation  can  be  satisfied,  however,  for  real  frequencies 
and  complex  wavevectors.  With  complex  a,  yand  kj^,  these  solutions  represent  dissipative 
waves  that  have  finite  path  lengths. 

To  illustrate  the  properties  of  these  dissipative  waves,  equation  (4.1)  was  solved 
numerically  for  MnF2  both  with  and  without  damping.  The  quantities  plotted  arc  unitless 
with  reduced  frequencies  of  co/fl  and  reduced  wavevector  kj^c/Si  (the  component  parallel  to 
the  surface). 

In  figure  4.1  the  dispersion  relations  for  bulk  and  surface  polaritons  are  reproduced 
for  the  case  of  Hq=0.  The  shaded  areas  represent  the  bulk  modes  and  the  dashed  lines 
between  the  two  bulk  bands  are  surface  modes  for  the  case  of  zero  damping.  Note  that  these 
surface  modes  stop  abruptly  ^t  the  top  of  the  lower  bulk  band  where  kx=co/c. 


64 


As  the  wavelength  decreases,  ciVk  goes  to  zero  and  the  surface  polaritons  asymptotically 
approach  the  zero  field  magnetostatic  surface  wave  frequency  given  by 

a)s^a(2He+Ha+47tM)] 

In  figure  4.1  the  surface  polariton  solutions  to  (4.1)  when  l/ojx=.(X)01  are  also 
included.  These  solutions  are  plotted  as  functions  of  reduced  frequency  and  the  real  part  of 
the  reduced  parallel  waveveaor.  These  dissipative  waves  lie  very  near  the  l/^lx=0  surface 
polaritons  in  the  frequency  region  above  the  lower  bulk  band  and  below  the  magnetostatic 
frequency.  Outside  this  region,  new  solutions  appear  and  are  represented  by  solid  lines. 

Above  (Oj  the  dissipative  waves  exhibit  a  curious  "backbending"  property  where  the 
group  velocity  changes  sign  and  the  modes  curve  inward  in  k^  toward  the  upper  bulk  band. 
This  back-bending  effect  is  reminiscent  of  a  similar  behavior  found  for  surface 
plasmon-polaritons  where  the  Fano  ("true"  siuface  modes)  bend  back  with  increasing 
frequency  into  what  are  sometimes  called  evanescent  modes.^^  The  evanescent  modes  are 
tightly  bound  to  the  surface  and  the  real  and  imaginary  parts  of  the  wavevector  component 
normal  to  the  surface  have  roughly  the  same  magnimde.  It  is  interesting  to  note  that  since  the 
point  of  bend-back  occurs  near  ©5,  the  corresponding  kjj  can  be  used  to  measure  the 
damping  parameter.  As  the  frequency  increases  above  ©5,  the  real  parts  of  y  and  a  become 
small  so  that  the  evanescent  modes  are  less  tightly  bound  to  the  surface.  Near  the  lower 
limit  of  the  upper  bulk  band,  the  real  parts  of  y  and  a  tend  to  zero  and  the  evanescent  modes 
become  ill-defined. 

With  damping  present,  the  polariton  mode  continues  into  the  lower  bulk  band  below 
the  lower  bulk  band  frequency  limit,  as  seen  in  figure  4.1.  This  is  a  region  forbidden  to  true 
surface  polaritons.  In  this  frequency  region  modes  can  exist,  with  damping  present,  at  an  © 
and  k„  for  which  a  wave  incident  on  the  material  from  vacuum  would  be  comnletelv 
o^nsmitted.  We  thus  identify  the  l/fit=.0002  mode  as  the  magnetic  analogue  of  the 
Brewster  mode  found  in  dielectric  materials. 


65 


-4-3-2-101234 

ck^Q 


I 

■ 

Figure  4.1.  Dispersion  curves  for  antifenomagnetic  polaritons,  in  MnF2,  with  no  applied 

I  field.  The  shaded  areas  are  bulk  bands  and  the  dashed  lines  are  the  surface 

modes  when  there  is  no  damping  present  The  solid  lines  show  how  damping 
modifies  the  surface  mode  fr^uencies.  These  solid  lines  are  the  real  parts  of  the 
_  complex  solutions  to  the  dispersion  relation  (4.1),  with  damping  l/Dt=.(XX)2. 


66 


Typically,  one  thinks  of  the  Brewster  angle  as  the  angle  of  incidence  where  there  is 
no  reflected  wave  from  a  surface  illuminated  by  a  wave  polarized  with  its  electric  field  in  the 
plane  of  incidence.  A  similar  angle  occurs  for  a  wave  incident  on  a  magnetic  material  and 
polarized  with  its  magnetic  field  in  the  plane  of  incidence.  This  angle  (or  rather  the 
corresponding  component  of  the  incident  wavevector,  kj^)  can  be  found  by  setting  to  zero  the 
appropriate  Rtesnel  relations  for  the  amplitude  of  the  magnetic  field  of  the  reflected  wave  (see 
appendix  A  for  the  Fresnel  relations).  Doing  so,  one  arrives  at  the  magnetic  Brewster 
condition: 

0Cx)Br=  «0o[liie2(lii2-^2^)-(lii^-ii2^)2]  /  (4.3) 

In  figure  4.2  the  oo  and  kj^  that  satisfy  equation  (4.3)  are  plotted  along  with  the 
solutions  to  the  dispersion  relation  (4.1)  for  l/nx«.0(X)l.  The  real  part  of  the  complex 
wavevector  solutions  are  plotted  against  the  frequency  both  with  an  applied  field  and 
without  When  there  is  no  applied  field  there  is  a  very  close  correspondence  between  the  two 
curves.  When  HQ=:.3kG,  however,  the  two  curves  begin  to  differ  for  frequencies  well 
within  the  lower  bulk  band. 

Since  Re(a)  determines  how  tightly  the  surface  wave  is  bound  to  the  surface  and 
Im(kjj)  governs  the  attenuation  of  the  wave  as  it  propagates  parallel  to  the  surface,  it  is 
interesting  to  plot  these  quantities  as  functions  of  frequency.  In  figure  4.3  llm(ckx/ii)l  is 
shown,  as  determined  from  the  dispersion  (4.1),  and  the  corresponding  Re(ca  /D)  for  zero 
applied  field  and  l/nt=.{)002.  The  solid  line  is  |Im(kx)|  and  the  dashed  line  is 
Re(ca  /D).  Imfkj^)  becomes  large  only  in  the  bulk  band  and  near  cOj.  In  the  surface  mode 


(o/Q 


67 


Figure  4.2.  Re;il  part  of  the  complex  dispersion  solutions  compared  to  the  Brewster  angle 
(in  terms  of  rather  than  6q).  The  upper  curves  arc  for  Hq=0  and  the  lower 
curves  arc  for  the  -k^  branch  of  the  surface  polariton  dispersion  with  HQ=.3kG. 
In  the  case,  the  curves  are  identical  at  higher  frequenicies. 


U/co 


68 


|lm(ck^Q)(,  Re(cO£/Q) 


Figure  4,3.  Decay  parameter  |Im(ck-/Q)|  and  localization  parameter  Re(ca  /fl )  for  the  +k^ 
branch  of  the  l/fiT=.0002  dispersion  curve  of  figure  4.1.  Only  the  magnitudes 
are  shown.  The  solid  line  islm(k^)  and  the  dashed  line  is  Re(ca  /O).  Note  the 
strong  localization  to  the  surface  just  above  qj/Q=:1.  The  localization  is  greatest 
in  the  surface  mode  region  near  the  lower  bulk  band  frequency  limit 


69 


region,  Im(kjj)  is  small  so  the  wave  has  a  long  path  length.  Re(a)  is  very  large  for 
frequencies  below  (Og  and  above  the  bulk  band,  thus  indicating  a  strong  localization  to  the 
surface.  The  localization  is  strongest  near  the  bulk  band  limit,  co/Q=l.  In  the  bulk  band, 
however,  Re(a)  is  small  and  so  the  mode  in  this  region  is  weakly  bound  to  the  surface. 

Damping  allows  the  surface  mode  to  lose  energy  into  the  material.  In  figure  4.4,  the 
+kjj  branch  of  the  dispersion  curve  plus  the  decay  parameters  ImOc^)  and  Re(a)  are  plotted 
for  1/G)X=.0008.  The  solid  line  is  ilm(kjj)l  and  the  dashed  line  is  Re(ca  IQ).  Hq  is  still  zero. 
With  greater  damping  the  surface  mode  is  not  as  tightly  bound  to  the  surface  as  before  and 
penetrates  further  into  the  material.  This  increases  the  rate  of  energy  loss  into  the  material 
and  thus  the  surface  mode  has  a  shorter  path  length.  In  this  way  damping  allows  the  modes 
to  "leak"  energy  into  the  interior  of  the  materiaL 

In  an  applied  field,  the  surface  modes  outside  the  bulk  bands  are  highly 
nonreciprocal.  The  resonances  in  the  bulk  bands  are  also  nonreciprocal  in  applied  fields, 
although  the  nonreciprocity  disappears  at  frequencies  below  the  lower  bulk  band  limit  This 
is  seen  in  figure  4,5  where  the  real  part  of  kj^  is  plotted  against  frequency  for  HQ=.3kG.  The 
dashed  lines  are  the  1/Dt=0  modes  and  the  solid  lines  are  the  dissipative  waves  for 
1/Dx=.0002.  The  1/Dt=0  modes  coincide  with  the  1/Dt=,(X)02  modes  except  near  the 
magnetostatic  limit  frequency.  Again  there  is  a  close  correspondence  between  the  1/Dt=0 
modes  and  the  modes  with  damping  in  the  surface  mode  region  between  the  bulk  band  and 
the  magnetostatic  limit 

With  the  applied  field,  there  are  two  regions  where  the  real  parts  of  y  and  a  become 
very  small  and  the  leaky  modes  become  iU-defined.  One  region  is  above  a)/Q=  1.0075,  near 
the  upper  limit  of  the  middle  bulk  band.  The  leaky  modes  do  not  seem  to  exist  for 
frequencies  above  1.0075.  The  second  region  occurs  for  the  +ky.  branch  near  co/D=.998. 
Here  Y  becomes  very  small  and  the  brewster  mode  is  iU-defined  for  frequencies  between  .998 
and  the  top  of  die  lower  buUc  band. 


tt)/Q 


70 


Figure  4.4.  Decay  parameter  |Im(ck^  /Q  )|  and  localization  parameter  Re(ca  /D )  for  the  +kjj 
branch  of  dispersion  curves  with  1/Dx=.0008.  The  solid  line  is  ImCk,^)  and  the 
dashed  line  is  Re(ca  /D).  There  is  no  applied  field  and  only  the  magnitudes  are 
shown.  Here  the  increa^  damping  decreases  the  localization  of  the  modes  to 
the  surface  at  all  frequencies. 


71 


1.015 


1.010 


2  1.0051 


0.995 


l.OOO-L:-" 


CO, 


-4-3-2-101234 


ck^/Q 


Figure  4.5.  Dispersion  curm  for  antiferromagnetic  polaritons,  in  MnF2,  with  an  applied 
tield  of  .3kG.  The  shaded  areas  are  bulk  bands  and  the  dashed  lines  are  the 
surface  modes  when  there  is  no  damping  present  in  the  material.  The  solid  lines 
show  how  dampmg  modifies  the  surface  mode  frequencies.  These  solid  lines 
uic  parts  or  me  complex  solutions  to  the  dispersion  relation  (4.1)  with 
damping  1/Qt-,(X)02.  Note  that  the  modes  with  damping  present  are  reciprocal 

higher  f^uenefes*"  become  highly  nonreciprocal  at 


72 


In  figure  4.6,  frequency  versus  IlmCk^)!  and  Re(a)  are  again  shown,  this  time  for 
the  case  of  an  jqipiied  field  of  0.3  kG.  The  solid  lines  are  |Im(kj^)|  and  the  dashed  lines  are 
Re(ca  /i2).  Values  for  the  -kj^  branch  are  shown  in  the  top  plot  and  values  for  the  +kjj 
branch  are  shown  in  the  lower  plot  Although  signs  are  not  shown,  Im(kj^)  is  negative  for 
the  -kjj  branch  so  that  the  wave  attenuates  in  the  -x  direction.  Again  there  is  strong 
localization  to  the  surface  in  regions  below  co^  and  above  the  bulk  band  for  both  branches. 
Also  note  the  strong  localization  of  the  surface  modes  where  they  begin  near  the  light  line 
(see  figure  4.5)  and  the  lack  of  localization  as  they  enter  the  upper  bulk  band.  Both  branches 
arc  strongly  bound  to  the  surface  near  the  lower  bulk  band  limit  and  near  co^.  In  between, 
however,  the  localization  decreases.  In  both  the  +k^  and  -k^  branches,  the  path  length  is 
large  except  in  the  lower  bulk  band  and  above  cOj  The  path  length  is  also  large  above  the 
middle  bulk  band. 

The  power  flows  in  surface  excitations  with  damping  present  help  clarify  the  nature 
of  the  leaky  modes.  Using  the  results  of  Camley  and  Mills^,  it  is  a  simple  matter  to  calculate 
the  amplitudes  of  the  electric  and  magnetic  fields  of  the  surface  polaritons  inside  and  outside 
the  material.  From  these  one  can  calculate  the  Poyntings's  vector  inside  and  outside  the 
material.  The  total  power  flow  parallel  to  the  surface  in  the  material  is  found  by  integratmg 
the  material  Poynting's  vector  over  a  rectangular  surface  of  width  from  y=0  to  y=<». 
Likewise,  the  total  power  flow  parallel  to  the  surface  in  the  vacuum  is  found  by  integrating 
the  vacuum  Poynting's  vector  over  a  rectangular  surface  of  width  from  y=0  to  y=-<«. 


73 


Figure  4.6.  Decay  jljn(ck  IQ  )|  and  locaHzation  zo  ^  .  u 

and  -k  branches  of  the^/nt=.0002  dispersio^  cums““of  figure  4  5  ^ 

Sm  are  ^  and  the  Ish’ed 

aid  &  o?5.e^^  f;  •TJJ?hlT  ^  '^hown  in  S?e 

localization  to  the  surface  just  Sbc  -  the 


74 


Defining  power  flows  per  unit  length  as  U^/L^  for  the  power  flow  in  the  material 
and  for  the  power  flow  in  the  vacuum,  one  obtains  the  expressions: 

U*--‘  ■Re{— 

2Re(Y)  47tco^ 


(4.5) 


and 


2Re(a)  4jtfi)^ 


(4.6) 

Here  is  the  amplitude  of  the  polaritons'  electric  field  at  the  surface. 

The  parallel  power  flows  and  U’^are  plotted  in  figure  4.7  for  the  frequency  and 
complex  wavevector  solutions  of  (4. 1 )  for  +kjf  with  no  applied  field  and  1/Dt=,0002.  In 
the  bulk  band  (ciyn<l)  the  power  flows  inside  and  outside  the  material  are  both  in  the 
direction  of  propagation.  At  lower  frequencies,  most  of  the  energy  is  carried  by  fields  in  the 
material.  Near  the  bulk  band  limit,  however,  most  of  the  energy  is  carried  in  the  fields 
outside  the  material,  and  at  the  antiferromagnetic  resonance  frequency,  co/D=l,  all  of  the 
electromagnetic  energy  is  carried  in  the  vacuunu 

In  the  surface  mode  region  above  the  bulk  band  and  below  CO3,  the  energy  in  the 
material  flows  opposite  to  the  direction  of  propagation.  This  is  typical  of  surface  plasmon 
and  magnon  polaritons.  Note  that  in  the  surface  wave  region,  most  of  the  energy  is  carried 
by  the  fields  in  vacuum  and  so  the  net  energy  flow  is  in  the  direction  of  propagation  except 
near  the  magnetostatic  frequency.  At  tOg  the  electromagnetic  energy  carried  by  the  fields  in 
vacuum  is  nearly  equal  to  the  energy  carried  in  the  opposite  direction  by  the  fields  in  the 
material  and  the  net  energy  flow  is  very  small.  Above  cOg,  the  energy  flow  is  in  the  direction 
of  propagation  both  in  the  material  and  outside. 


>n  <a 


75 


co/Q 


Figure  4.7.  Parallel  power  flows  U^and  inside  and  outside  the  material  for  the 

branch  of  1/Qx=.0002  modes  in  zero  applied  field.  Note  that  in  the  surface 
mode  region  between  ©5  and  the  lower  bulk  band  limit  (ci>/fi=l )  the  energy  flow 
inside  the  material  is  opposite  the  direction  of  propagation. 


76 


The  power  flows  and  I  are  plotted  in  figure  4.8  for  the  frequency  and  complex 
wavevector  solutions  of  (4.1)  with  an  applied  field  HQ=.3kG.  The  damping  is  l/flx=.0(X)2 
in  the  upper  plot  and  .0(X)5  in  the  lower  plot  Here  only  the  -kj^  solutions  are  shown.  For 
both  dampings  energy  flow  in  the  material  is  opposite  the  direction  of  propagation  for 
frequencies  near  (Og.  Near  the  bulk  band,  however,  the  energy  flow  in  the  material  is  in  the 
direction  of  propagation.  Furthermore,  comparison  of  the  upper  and  lower  plots  shows  that 
increasing  the  damping  decreases  the  frequency  range  for  energy  flow  in  the  material  that 
opposes  the  direction  of  propagation. 

The  direction  of  energy  flow  in  the  material  is  governed  by  the  sign  and  magnitude 
of  the  magnetic  susceptibilities.  A  negative  ji^  usually  leads  to  energy  flows  opposite  the 
direction  of  propagation.  This  occurs  in  the  surface  mode  region  when  there  is  no  applied 
field.  Also  with  no  applied  field,  |X|  is  positive  in  the  bulk  band  and  the  energy  flow  is  in  the 
direction  of  propagation.  In  an  applied  field,  however,  the  non-vanishing  \i2  can  lead  to 
positive  energy  flows  in  the  surface  region  and  negative  energy  flows  in  the  bulk,  near  the 
bulk  band  limits.  This  is  because  near  the  limit  frequencies  a,  |J.  j  and  \i2  are 

extremely  sensitive  to  frequency.  Consequently,  near  £2  the  direction  of  the  power  flow  is 
very  sensitive  to  the  introduction  of  an  applied  field.  In  addition,  damping  also  plays  an 
important  role  near  the  bulk  band  limits,  as  seen  by  comparing  the  two  plots  of  figure  4.8. 


Figure  4.8.  Parallel  power  flows  U^and  U'^  inside  and  outside  the  material  for  the 

branches  of  l/iix=,(XX)2  modes  with  a  .3kG  applied  field.  Tne  upper  plot  is  for 
the  case  of  l/£lt=,0002  and  the  lower  plot  is  for  the  case  l/fit=.0005.  Note 
that  the  direction  of  power  flow  inside  the  material  is  strongly  influenced 
through  the  nonzero  1x2 


78 


4.3  Surface  polaritons  with  1  jidau-Lifshitz  damping. 

The  form  of  the  damping  terms  appearing  in  the  susceptibilities  originate  from 
adding  Bloch-Blocmbergen  terms  to  the  equations  of  motion.  These  have  the  simple  form 


-MxM/t 


(4.7) 


and  so  represent  a  relaxation  of  the  magnetizations  to  their  equilibrium  values.  As  such, 
these  terms  can  only  be  valid  at  high  fitcquencies  where  the  dissipative  effects  are  averaged 
over  several  precessions  of  the  magnetizations. 

Consideriiig  the  importance  of  damping  on  the  nature  of  the  leakj'  modes,  it  is  of 
mtercst  to  examine  the  leaky  modes  when  the  damping  mechanisms  involve  much  shorter 
relaxation  times.  This  can  be  accomplished  by  choosing  damping  terms  of  the 
Landau-Lifshitz  form:  ^4 


-AMxM 


(4.8) 

A  is  a  parameter  which  determines  the  magnitude  of  the  damping.  These  terms 
represent  a  torque  in  the  direction  of  the  instantaneous  magnetization  and  thus  represent 
relaxation  to  the  instantaneous  value  of  the  internal  field.  These  terms  are  a 
phenomenological  description  which  are  used  extensively  to  explain  certain  phenomena  in 
ferromagnetic  resonance.  A  derivation  of  the  terms  in  (4.8)  for  antiferromagnets,  done 
originally  for  Landau  damping  terms  in  ferromagnetic  resonance,^^  can  be  obtained  from  the 
free  energy  for  the  magnetic  system.  This  calculation  is  outlined  in  appendix  in  for  the 
corresponding  antiferromagnetic  terms. 


79 


Bloch's  equations  of  motion  for  the  A  sublattice  magnetization,  with  Landau 
damping  included,  ate 


Ma=  -  A[MaX(MaXHa)]  /M^ 


(4.9) 

where  the  effective  fields  are  deHned  by  equations  (2.3),  y  is  the  gyromagnetic  ratio  and 
M  is  the  saturation  magnetization.  A  similar  expression  holds  for  the  B  sublattice 
magnetization  M^j.  These  constitute  a  set  of  six  coupled  equations  which  are  to  be  put  in  the 
form 


M 


(4.10) 

A  time  dependance  e^^^  is  assumed  and  the  equations  are  linearized  in  the  mj^,  niy,  hj^,  hy 
variables.  This  means  the  susceptibilities  are  valid  only  for  small  arigles  of  precession  for  the 
magnetizations  such  that  M2=0. 

After  straightforward  algebra,  the  susceptibility  tensor  is  found  to  have  the  same 
form  as  before  (equation  (2.3))  but  the  components  are  now  given  by 


2M'l(a(n2-Q2-(o2>fiAC0(Q)2-Q2+£j2-2fl2)] 

”  (iy‘-20)2(Q2+Q2^Q2_Q2)^2iA(D(©2+rf_Q2)(Q2+ji2yQ^ 


(4.11) 


80 


and 


4iY(efl-(iAa)-Q ) 

^*^~a(^-2(d^Q2^^2)4.(Q2^2j2+2iAaXa)Z+Q2-Ci2)(Q24i)2)/fl 
0  0  0  **  “ 


(4.12) 

The  various  quantities  are  Qq=t^Iq,  ^a^*T^^a  ^^^=7^Hj(Ha+2Hg)  where  is  the 

anisotropy  field,  is  the  mean  exchange  field,  H  is  the  antiferromagnetic  resonance  field 
and  Y  is  the  gyromagnetic  ratio. 

The  remaining  components  are  given  by  the  simple  relations  Xyy=Xxx 
Xy^=-Xxy.  Note  that  with  A=0,  these  reduce  to  the  expressions  given  by  equations  (2.4) 
and  (2.5)  for  x-oo. 

The  fundamental  difference  between  these  susceptibilities  and  the  Bloch  damped 
susceptibilities  is  the  dependance  of  the  damping  on  frequency.  Since  the  magnitude  of  the 
damping  effects  increase  with  frequency  with  this  case,  the  surface  leaky  modes  should  be 
most  effected  at  higher  frequencies.  In  figure  4.9  the  real  and  imaginary  parts  of  and  the 
real  part  of  a  are  plotted  against  frequency  for  the  case  A=.(X)2  and  HQ=.3kG.  The  -kj^ 
branch  of  the  dispersion  curve  is  shown. 

The  general  characteristics  of  the  leaky  modes  are  the  same  as  in  the  Bloch  damping 
case.  In  the  Brewster  mode  region  at  lower  frequencies,  the  leaky  wave  is  strongly 
attenuated  in  the  propagation  direction  and  weakly  bound  to  the  surface.  In  the  surface 
polariton  region,  the  wave  has  a  long  path  length  and  is  strongly  bound  to  the  surface.  In  the 
evanescent  region,  where  the  new  Landau  damping  effects  should  be  most  pronounced,  there 
is  again  a  weak  localization  to  the  surface  and  a  shorter  path  length. 


81 


An  interesting  consequence  of  the  Landau  form  of  the  damping  is  in  the  stability  of 
the  leaky  modes  in  regions  where  the  localization  of  the  leaky  modes  to  the  surface  is  weak. 
As  discussed  before,  when  the  real  parts  of  either  y  or  a  become  vanishingly  small  the  leaky 
mode  wave  does  not  represent  a  stable  solution.  These  imstable  frequency  regions  were 
found  to  be  near  the  top  of  the  lower  and  middle  bulk  polariton  bands.  When  the  Landau 
form  of  the  damping  is  assumed,  these  frequency  regions  become  more  stable. 


(d/Q. 


82 


Figure  4.9.  Dispersion  curves  and  decay  parameters  for  surface  excitations  on  MnF^  with 
no  applied  field.  The  damping  terms  are  of  the  Landau  form  with  A=.002. 


CHAPTERS 


ELECTROMAGNETIC  GREEN'S  FUNCTIONS 

To  date,  theoretical  discussions  of  the  properties  of  antiferromagnetic  polaritons 
have  focused  on  dispersion  relations  calculated  from  the  electromagnetic  wave  equation  for 
anisotropic  magnetic  media.  Dispersion  curves  describe  the  possible  allowed  modes  of  the 
magnetic  system,but  do  not  describe  how  these  modes  may  be  excited.  It  is  often  useful, 
however,  to  know  the  response  of  the  magnetic  system  to  some  external  probe-  such  as  the 
response  of  an  antifenomagnet  to  an  incit^t  light  wave.  Green's  functions  are  a  primary 
tool  for  dealing  with  these  types  of  quesdons.^^ 

The  objective  of  this  chapter  is  to  calculate  the  electromagnetic  green's  functions  for 
a  semi-infinite  antifenomagnet  As  usual.  Green's  functions  can  be  used  to  obtain  a  variety 
of  information  about  the  allowed  modes  of  a  system.  Dispersion  relations,  spectral  densities 
and  the  effective  lifetime  or  attenuation  length  can  all  be  obtained  from  the  Green's 
functions.^  In  addition,  Green's  functions  can  be  used  in  a  variety  of  applications. 
Examples  include  the  calculation  of  Raman  and  Brillouin  scattering  spectra,  nonlinear 
interactions  of  surface  polaritons,  and  thermodynamic  properties.^^^^  In  the  next  chapter 
these  Green's  functions  are  used  to  study  the  reflectivity  from  an  antiferromagnet  with  a 
slightly  rough  surface. 

This  chapter  begins  with  the  calculation  of  the  Green's  functions  appropriate  to  a 
semi-infinite  antifenomagncL  To  simplify  the  algebra,  only  the  case  of  polarization  parallel 
to  the  easy  axis  is  considered.  Finally,  the  surface  polariton  modes  discussed  in  previous 
chapters  are  shown  to  also  be  represented  by  the  Green's  functions. 


84 


5.1  Green's  functions. 

The  geometry  is  the  same  as  in  the  previous  chapters.  The  material  is  in  the  y>0 
half-space  with  the  surface  at  y=0.  The  antiferromagnet  is  uniaxial  and  the  magnetizations  of 
the  two  sublattices  lie  along  the  dtz  axis.  The  applied  field  also  lies  along  z.  Inside  the 
material,  the  susceptibilities  are  given  by  equations  (2.3)  and  (2.9).  Outside  the  material, 
where  y  <0,  the  susceptibilities  are  uniform  and  describe  a  vacuum: 


(5.1) 


(5.2) 


magnetic  wave  equation  for  the  propagation  of  transverse  electromagnetic  waves  in 
media  described  by  the  susceptibilities  (2.3)  and  (2.8)  under  the  influence  of  some  driving 
field  F  has  the  form 


The  prime  on  the  sum  means  that  m9!:j,k,l.  Also,  the  shorthand  notation  C0q=Q}/c  is  used. 
Since  the  dielectric  tensor  is  diagonal  in  this  problem,  Note  that  both  the  magnetic 

fields  H  and  the  driving  fields  F  arc  assumed  to  have  time  dependance 

The  Green's  functions  satisfy  the  associated  wave  equation  given  by 


( 

f 

\ 

\ 

1 

-  8 

le- 

jl' 

-"o^y 

1  «  dx.  dx^ 

Jk 

[  1 

dxi  1 

1  ^  * 

I 

'o  1  8j„M=4K5,^5ix-x') 


(5.4) 


85 


The  solutions  to  (5.3  )  are  then  given  by 


(5.5) 

The  volume  of  integration  is  over  ail  of  space. 

Equation  (5.4)  must  be  translationally  invariant  in  both  x  and  z.  This  invariance  can 
be  exploited  through  the  use  of  Fourier  expansions: 


d  k..  i  k,|  •  (x-x*)  -♦ 

-T' 

4i( 


(5.6) 


and 


S  (?.x*)  =  5  (y  •  y') 


oo 

J  ^  JV 


4n 


(5.7) 

Here  we've  made  the  definitions  ky  =  +  tk^  for  the  wavevector  parallel  to  the  plane 

y=Oand^l=xx  +zr  for  the  position  vector  in  that  plane. 

Applying  these  transformations  to  the  Green's  function  equation  (5.4),  and  writing 
the  result  in  explicit  matrix  form. 


■fd-k]  fk^D) 


Akl^k]) 


k  k 

X  z 


■ik^D 


-ik  D 

z 


d  -k\ 


<r> 

g  =-4;rIa:y-y') 


(5.8) 


86 


with  the  definitions 


0=^1  By 

(5.9) 

(5.10) 

(5.11) 

The  electric  and  magnetic  fields  must  satisfy  tht  usual  homogeneous  electiomagnetic 
boundary  conditions  at  the  surface  y  »  0  (assuming  that  the  sources  of  the  driving  fields  are 
not  located  at  the  surface  y  =  0  so  that  the  boundary  conditions  are  homogeneous).  The 
condition  that  the  normal  component  of  B  be  continuous  across  y  ==  0  is  first  applied.  Using 
the  H  field  given  by  equation  (5  J)  to  calculate  B,  applying  the  boundary  condition  and 
equating  coefficients  of  results  in  the  condition 


’  8ym  -0+  3 


y  »0- 


(5.12) 


Similarly,  continuity  of  tangential  H  results  in 


lyaO.). 


xm  Jy-0- 


(5.13) 


y 


y 


-0- 


(5.14) 


87 


Finally,  relating  E  to  H  via  Maxwell's  relations,  continuity  of  tangential  E  requires  that 

Szm-^KSymX^  (5.15) 

^ym  ^  SjKi  ly  >0+  ~  ^  t*’  X Sytn  +  *  ^  Sxm  ly  »0-  (5.16) 

linear  combination  of  equations  (5.13)  and  (5.14)  gives  an  equation  identical  to  that  which 
arises  by  requiring  continuity  of  normal  D  across  y  =0,  thus  providing  no  new  information. 

The  complexity  of  the  algebra  makes  an  analytical  solution  for  the  case  of  general 
propagation  directions  difficult  The  main  features  of  interest,  however,  can  be  had  by 
considering  the  simpler  case  where  the  electric  field  is  constrained  to  lie  along  the  z  axis  and 
the  incident  electromagnetic  wave  propagates  in  the;o»  plane.  This  is  called  the  Voigt 
geometry  and  is  a  common  experimental  set  up.  In  addition,  the  nonreciprocity  is  often  a 
maximum  in  this  geometry.  The  Voigt  geometry  uncouples  the  field  from  the  and  Hy 
fields,  as  can  be  seen  by  examining  the  equation  of  motion  matrix  (5.8).  This  greatly 
facilitates  the  separation  and  solution  of  the  differential  equations  of  motion  and  the 
application  of  the  boundary  conditions. 

Note  that  in  the  special  case  of  the  electric  field  along  z,  g  has  no  z  dependence  and 
thus  its  Fourier  expansion  is  in  one  dimension  only.  Accordingly,  the  wave  equation  (5.4)  is 
transformed  with  the  one  dimensional  counterparts  of  the  transforms  (5.6)  and  (5.7). 

The  Green's  functions  differ  depending  on  whether  the  source  point  y’  lies  in  the 
material  or  outside  in  vacuum.  Beginning  with  the  case  where  y'  lies  outside  the  material, 
the  matrix  equation  for  g  becomes 

,  g(i- y.y' )  =  0 


(5.17) 


88 


for  y  >0  and  y'  <0  (source  outside  the  material  and  observation  point  inside  the  material).  In 
this  polarization  there  is  no  z  component  of  the  H  Held  so  g  is  now  a  2x2  matrix.  The 
homogeneous  set  of  equations  (5.17)  can  be  shown  to  have  the  solutions 


0.!^  :=C  -^  e-^ 


where  a  represents  the  decay  constant  in  the  y  direction  and  is  given  by 


(5.18) 


a-..  / 

X  0  2 


(5.19) 


Note  that  the  positive  root  is  specified  to  insure  proper  decay  in  the  y  direction.  The 
coefficients  C,y^  in  (5.18)  will  be  determined  later  through  the  boundary  conditions. 

For  fields  outside  the  material,  the  appropriate  Green's  functions  satisfy  the 
inhomogeneous  equations 

D^  +  Q)^  -ikD 

o  * 

-ik  D  CfljZ 

*  O'* 

(5.20) 

fory  <0  andy'<0. 

The  solutions  to  this  set  of  equations  will  be  a  linear  combination  of  a  particular 
solution,  which  solves  the  inhomogeneous  equation,  and  a  solution  to  the  associated 
homogeneous  equation.  Thus 


g(t,,;y)  =  -  4nlS(y-y) 


Sij  8ij 


(5.21) 


89 


where  gij^  is  the  homogeneous  solution  and  g^P  is  the  particular  solution. 
The  particular  solutions  obey 


0 

(5.22) 

0 

(5.23) 

Here  Y  is  the  decay  constant  in  the  y  direction  outside  the  material.  It  is  given  by 


(5.24) 

Again,  the  positive  root  is  chosen  so  that  the  wave  decays  away  from  the  surface. 

The  solutions  to  the  equations  (5.22)  and  (5.23)  must  vanish  at  y=±oo  in  order  to 
represent  physical  waves.  These  solutions  are  then  gi/en  by  the  relations 


and 


2  2 

{D^-a)  -i- - ^(5(y-y) 


(5.25) 


a^)jsgn{y  ' 


=  D5(y-y') 


(5.26) 


90 


An  important  point  to  notice  is  that  as  long  as  a  is  real,  the  solutions  vanish  at  y=±«> 
according  to  I  Thus  (5.25)  and  (5.26)  apply  even  if  a  is  complex.  If  a  becomes 

imaginary,  however,  then  in  order  to  represent  an  outgoing  wave  at  both  +oo  and  -«>  die 
solutions  must  be  of  the  form  e^ly“y  L  This  requirement  leads  to  relations  corresponding  to 
(5.25)  and  (5.26)  for  the  case  a=ib: 


and 


2  2  e 

(dW) 


ibly-yi 


2ib 


=  8(y-y’) 


(5.27) 


(DW)-isgn(y-y')  e =  D8(y-y’) 


(5.28) 

Comparison  of  (5.27)  and  (5.28)  to  (5.25)  and  (5.26)  show  that  solutions  for  the  case  a 
imaginaiy  are  obtained  from  the  solutions  for  a  real  by  letting  a  go  to  -ia  in  the  real  a 
solutions.  The  reverse  transfomation  is  obtained  by  letting  a  go  to  ia  in  the  solutions  for  a 
imaginary  to  get  the  solutions  for  a  real.  Rather  than  write  two  sets  of  green’s  functions  for 
the  cases  7  and  a  real  and  imaginaiy,  only  the  form  of  the  solutions  for  both  y  and  a  real  are 
presented,  with  the  stipulation  that  the  solutions  for  y  and  a  imaginary  are  obtained  by  letting 
y  go  to  -iy  and  a  go  to  -ia.  Note  that  since  0  is  a  branch  point  for  both  a  and  y,  this 
prescription  is  equivalent  to  choosing  a  sign  convention  across  a  branch  cut 

When  damping  is  present,  as  it  will  be  in  the  next  section,  a  and  y  become  complex. 
In  order  that  the  Green's  functions  represent  both  an  outgoing  wave  and  a  vanishing 
exponential  at  ±00,  the  imaginary  parts  of  a  and  y  must  be  negative  for  the  forms  given  by 
(5.25)  and  (5.26).  In  order  for  the  forms  in  (5.27)  and  (5.28)  to  satisfy  the  same  boundary 
conditions,  however,  a  and  y  must  have  positive  imaginary  parts.  The  transformation  rule  to 
get  from  the  solutions  of  (5.25)  and  (5.26)  to  the  solutions  of  (5.27)  and  (5.28)  is  thus 
generalized  by  letting  a  go  to  -ia*  and  y  go  to  -iy*  with  the  convention  that  the  real  parts  of 


91 


a  and  y  are  positive  and  the  unaginary  parts  are  negative. 

Whether  the  imaginary  and  real  parts  of  either  a  or  y  have  the  same  or  different 
signs  is  determined  by  the  sign  on  x  in  the  equations  of  motion  and  the  boundary  conditions 
at  oo.  As  a  final  comment,  the  form  of  the  Green's  functions  when  y  and  a  are  complex  is 
chosen  according  to  whether  the  fiequency  and  wavevector  are  in  the  bulk  or  surface 
regions.  As  discussed  in  Chapter  2,  the  surface  region  for  waves  outside  the  material  is 
defined  as  the  region  in  the  dispersion  curves  where  tylxo^  and  the  surface  region  for  waves 
in  the  material  is  where  lal>lk>,l.  The  bulk  region  for  waves  outside  the  material  is  where 
[Y|<©q  and  the  bulk  region  for  waves  in  the  material  is  where  lal<l^l. 

The  particular  solutions  to  equations  (5.22)  and  (5.23)  are 


gP  y'y  -y'' 

®Xt  ~  2  ^ 

o 


(5.29) 


°xy 


2m  k 


(0“ 


'  sgn(y 


(5.30) 


The  homogeneous  solutions  to  the  equation  set  (5.20)  are  given  by; 


8ij^=qj<eyy  (5.31) 

The  next  task  is  to  derive  and  apply  boundary  conditions  to  determine  the 
coefficients  C,y .  First,  the  homogeneous  equations  of  motion  (5.17)  is  examined.  These 
provide  a  relationship  between  and  gy^  valid  as  y  approaches  0  ftom  the  positive  side: 

[  +k^D)g„+(kJ'-a)Je  n  )g  1  =0 

■'y=Of 


(5.32) 


92 


Similarly,  the  equations  of  motions  in  (5.20)  provide  a  relationship  valid  when  y  approaches 
0  &om  the  negative: 


*•  •‘y  =0- 

(5.33) 

Together  with  the  continuity  condidon  on  normal^  from  equation  (5. 12),  these 
relations  result  in  a  boundary  condition  on  : 


=0+ 


(5.34) 

The  continuity  condition  of  tangential  H  given  in  (5.13)  provides  a  second  boundary 
condition  on  . 

Fory>0,  the  solution  is  given  by  equation  (5.18).  Fory<0,  also  includes 
the  particular  solution  (5.29)  in  addition  to  the  homogeneous  part  (5.31).  Application  of  the 
boundary  conditions  (5.13)  and  (5.34)  determine  the  coefficients  from  the 
homogeneous  parts  of  the  solutions  and  result  in  the  following  expressions  for  the  solutions 
in  each  region.  For  y '  <0  and  y  >0, 


0 


(5.35) 


and  for  y'  <0  and  y  <0, 


93 


0 


(5.36) 


The  quantities  A  andB  are  defined  as 

A  =  %%tii-k^ 

(5.37) 

(5.38) 

The  poles  of  the  Green's  functions  occur  when  A- B  =0.  This  condition  can  be  shown  to 
give  the  andfeiiomagnetic  surface  polariton  dispersion  relation  derived  by  Camley  and 
Mills.3 

The  relationships  between  and  gy^  given  in  the  equations  of  motion  (5.17)  and 
(5.20)  can  be  used  to  easily  determine  gy^  from  equations  (5.35)  and  (5.36).  The  result  for 
y'  <  0  and  y  >  0  is 


4jtiy 


0 


/ 

^ . 


(D^e  u  -  ak 
0  2  2  ^ 

A-B 


-ay 


while  for  y'  <  0  and  y  <  0, 


(5.39) 


Asgn(y-y')+5 

A-B 


sgn(y+y')  e 


-y\y  -ty'  I 


+  sgn(y-y')c 


■y\y-y’\ 


) 


(5.40) 


93 


-  2si  rAsgn(y-/)+B 
0)2  ^  ^  -5 


g-y(y +/) 


+  g-yiy-y’i 


} 


(5.36) 


The  quantities  A  andS  are  defined  as 

(5.37) 

B  =Y(n,a-ttjcJ 

(5.38) 

The  poles  of  the  Green's  functions  occur  when  A- B  =0.  This  condition  can  be  shown  to 
give  the  antiferromagnetic  surface  polariton  dispersion  relation  derived  by  Camley  and 
Mills.3 

The  relationships  between  and  gy^  given  in  the  equations  of  motion  (5.17)  and 
(5.20)  can  be  used  to  easily  determine  from  equations  (5.35)  and  (5.36).  The  result  for 
y'  <  0  and  y  >  0  is 


4  = 


4niY 


0)' 


a)2£  M  -ok 
0  2  2  ^ 
A-B 


.  yy' 
’e'  e 


•oy 


while  for  y'  <  0  and  y  <  0, 


(5.39) 


A  sgn(y-y')+B 
A-  B 


(5.40) 


94 


The  remaining  Green's  functions  are  found  in  much  the  same  manner.  The 
particular  solution  for  is  given  by  (5.30),  but  boundary  conditions  are  still  required  to 
find  and  for  the  homogeneous  solutions  given  by  (5.18)  and  (5.31).  Note  that  in 
this  problem  the  source  terms  do  not  lie  at  y'  =  0,  so  only  the  homogeneous  counterparts  of 
the  inhomogeneous  equations  belonging  to  (5.20)  are  used  to  uncouple  the  boundary 
conditions  for  .  In  particular,  firom  (5.20)  a  relationship  valid  at  y=0-  is  obtained: 

+  1  =0 
-‘y  =0- 

(5.41) 


The  corresponding  relation  in  (5.17)  is: 

[i 0 ) «.,  +  (ffl  V,  - )  *„  1  =0 

•*y  =0+ 

(5.42) 

As  before,  this  is  combined  with  the  continuity  equation  on  normal  B  (5.12)  to  derive  the 
boundary  condition 

■*y  =  0+  -'y  =0- 

(5.43) 


Again,  the  continuity  of  tangential  H  (5.13)  provides  a  second  boundary  condition. 

Application  of  these  conditions  on  the  solutions  given  by  (5.18)  fory  >0,  and  the 
sum  of  (5.30)  and  (5.31)  fory>0  determine  C  .  The  resulting  expression  for  g„  fory  '<0 
andy  >0is 


95 


and  the  expression 


(5.44) 


A+&gti(y-y')B 

A-B 


(5.45) 


fory'<0and}»<0. 

Finally,  the  equations  of  motion  in  (5.17)  and  (5.20)  are  used  to  de'£nnine  gyy  from 
the  of  equations  (5.44)  and  (5.45).  The  result  for  y  ’<  0  and  y  >  0  is 


47tk 

a 

^yy  ? 
(at 


(oh^ 


.yy\-«y 


A-B 


(5.46) 


For  y'  <  0  and  y  <  0  the  inhomogeneous  term  from  (5.20)  must  be  included: 


2)rt/  (  A  »gn(y-y'iB 
0 


g  r(y+/) 


1  y2 


S(y-y') 


(5.47) 

The  above  Green's  functions  reduce  to  those  of  Mills  and  Maradudin^^  in  the  nonmagnetic 
limit  iJLj=l,  pL2=0,  andfij  =6^. 


96 


5.2  Surface  polaritons. 

When  the  gjj  are  considered  as  functions  of  for  an  o)  fixed  in  the  surface  polariton 
region,  die  gjj  have  sharp  peaks  at  those  k^  that  satisfy  the  surface  polariton  relation.  To 
illustrate  this,  in  figure  5.1  the  imaginary  part  of  g^^  (evaluated  at  the  surface  just  inside  the 
material )  is  plotted  vs  the  unitless  wavevector  ck^  /H  for  a>/l[2=l.(X)2  and  ©/Q=.9989.  For 
simplicity,  there  is  no  applied  field  and  a  damping  of  l/nT=.(XX}2  so  that  the  peaks  have 
finite  width. 

The  l.(X)2  peak  occurs  at  a  k^  where  a  surface  polariton  exists.  At  this  frequency 
and  wavevector,  the  excitation  decays  exponentially  away  from  the  surface  with  a  negligibly 
small  radiative  part  The  curve  for  oyQ=.9989,  however,  represents  a  different  excitation  in 
a  frequency  region  forbidden  to  surface  polaritons.  This  excitation  is  unlike  the  "true" 
surface  polaritons  in  that  it  has  a  significantly  large  radiative  part  in  directions  normal  to  the 
surface. 

As  discussed  in  the  derivation  of  the  Green's  functions,  care  must  be  taken  in 
choosing  the  appropriate  signs  on  the  imaginary  parts  of  y  and  a  in  order  to  satisfy  the 
outgoing  wave  boundary  condition  at  infinity.  When  these  conditions  are  strictly  obeyed,  the 
surface  polariton  peak  at  co/fl=l.(X)2  exists  but  the  peak  at  a)/fi=.9989  does  not.  The  peak  at 
co/G=.9989  was  produced  by  relaxing  the  boundary  conditions  on  exponentially  decaying, 
outgoing  waves  at  -<». 

The  dependance  on  "non-physical"  boundary  conditions  at  «>  is  a  characteristic  of 
the  leaky  modes  and  has  received  a  great  deal  of  attention  in  the  literature.^^*^^  Leaky 
modes  are  not  eigenmodes  of  the  system,  and  their  existence  is  strongly  dependant  on  the 
geometry  of  the  sample  and  the  driving  electtomagnetic  fields.Leaky  modes  are  typically 
represented  as  exponentially  increasing  into  the  material,  with  the  understanding  that 


97 


Figure  S.l.  Imaginary  parts  of  gvv  ^  Ci)/£2»1.002  and  a>/n».9989  as  functions  of  /fl 
There  is  no  applied  ndd  and  l/(iyc=.0(X)2.  The  oyflNl.OOZ  peak  occurs  at  the 
+kw  surface  ^iariton  of  figure  4.3.  Note  the  existence  of  an  excitation  in  the 
cu/^.9989  curve  that  lies  in  a  frequency  region  forbidden  to  surface  polaritons. 


98 


absorption  by  the  material  prevents  unbounded  growth.  The  use  of  the  Green's  functions  to 
show  their  existence  is  only  an  approximanon  and  is  justified  mainly  by  the  fact  that  leaky 
modes  are  observed  in  experiment  and  account  for  significant  losses  fix>m  the  radiation 
fields.^  ^ 

An  approximation  that  ignores  the  boundary  conditions  at  oo  allows  the  Green's 
functions  with  damping  to  show  excitations  at  real  fiequencies  and  real  wavevectors  that 
represent  magnetic  Brewster  modes.  A  dispersion  relation  can  be  obtained  from  the  Green's 
functions  in  the  following  way.  Fix  a  value  of  ©  and  plot  g(k,©)  as  a  function  of  k^.  The 
value  of  k^  for  which  there  is  a  peak  and  the  initial  value  of  ©  provide  one  point  of  a 
dispersion  curve.  By  repeating  the  process  for  different  ©  values,  one  can  trace  out  the  entire 
dispersion  curve. 

Figure  5.2  presents  the  comparison  between  the  Green's  function  results  and  the 
results  from  equation  (4.1)  with  zero  applied  field.  Here  1/Gt=:.0001,  and  only  the  region 
near  the  resonance  inside  the  bulk  band  is  examined.  Tne  two  curves  represent  the  results 
from  (4.1)  for  ©  vs.  Re(k^)  and  the  Green’s  function  results  for  ©  vs.  k,^.  While  the  Green's 
function  and  the  Re(k^)  curves  are  in  poor  agreement  at  smaller  frequencies,  they  approach 
one  another  as  the  frequency  increases.  To  understand  this,  it  is  useful  to  examine  the 
imaginary  part  of  the  kj^  as  a  function  of  frequency.  It  turns  out  that  hnfk^)  is  large  at  the 
lower  frequencies,  while  at  higher  frequencies  Imfk^)  is  much  smaller.  Since  the  peaks  in 
the  Green's  function  occur  at  real  wavevectors,  there  good  agreement  with  the  dispersion 
relation  results,  when  k^  has  a  large  imaginary  part,  is  not  expected. 

Even  despite  the  boundary  condition  approximation,  the  Green's  functions  and  the 
dispersion  curves  obtained  by  solving  (4.1)  are  fundamentally  different  things.  Equation 
(4.1)  describes  possible  excitations  that  may  exist  on  the  material,  and  the  Green's  functions 


99 


describe  the  propagation  of  waves,  originating  from  some  source,  through  the  material.  In 
the  dispersion  description,  the  wave  attenuates  according  to  the  imaginary  part  of  the 
wavevector  solution.  The  response  function,  in  contrast,  has  peaks  at  real  ficequencies  and 
wavevectors,  and  it  is  the  width  of  the  peaks  which  can  be  related  to  the  attenuation  of  the 
wave. 


100 


Figure  5.2.  The  si^ace  resonance  poles  of  as  functions  of  real  to  and  real  k-  with 

damping  l/cot^.OOOl.  There  is  no  applied  field.  Also  plotted  is  the  dispersion 
curve  of  the  surface  resonance  obtaii^  by  solving  the  damped  dispersion 
relation  (3.1)  with  real  frequency  and  complex  k-.  The  g-w  poles  approach  the 
dispersion  curve  at  higher  frequencies  where  Im^kj^/ft)  becomes  small. 


CHAPTER6 


SCATTERING  FROM  ROUGH  SURFACES 

Numerous  theoretical  methods  have  been  developed  which  attempt  to  describe  the 
effects  of  various  types  of  surface  roughness.  Scattering  due  to  random  roughness  has  been 
treated  by  perturbation  methods  in  the  limit  of  small  surface  height  fluctuations  using  a 
variety  of  classical  formulations.  Earlio*  methods  expanded  the  fields  and  surface  profiles  in 
power  series  under  the  assumption  of  small  fluctuations  in  surface  heights.^^*^^  Later 
formulations  considered  perturbations  in  the  dielectric  functions  due  to  surface  roughness 
and  solved  integral  equations  in  the  manner  of  the  Bom  approximation  of  scattering 
theory.  ^  ^  Microscopic  quantum  mechanical  approaches  have  also  been  implemented,  and 
the  resulting  expressions  for  scattering  cross  sections  have  been  shown  consistent  with  the 
classical  perturbation  Lheories.^’^ 

The  above  approaches  all  suffer  from  the  assumption  of  roughness  profiles  that  are 
small  with  respect  to  the  wavelength  of  the  incident  light  These  perturbative  techniques  are 
usually  only  carried  to  first  order  and  subsequently  do  not  describe  many  of  the  more 
interesting  features  found  in  large  amplitude  studies  (band  gaps,  frequency  shifts,  etc.) 
Alternative  approaches  have  also  been  developed  which  attempt  to  deal  with  large  amplitude 
gratings,  and  these  constitute  a  whole  subject  unto  themselves.  Most  notable  are 
formulations  of  the  Rayleigh  hypothesis,  which  assumes  that  fields  inside  and  outside  the 
surface,  or  selvedge,  region  are  also  valid  inside  the  selvedge  region  and  may  be  used  to 
satisfy  tne  appropriate  boundary  conditions  at  the  surface.  Another  technique  is  an  integral 
equation  method  based  on  the  "extinction  theorem”  which  allows  a  formally  exact  solution 
for  the  scattered  fields  in  all  regions  without  invoking  the  Rayleigh  assumption.^^’^^  Each 


102 


method  has  its  disadvantages,  the  Rayleigh  hypothesis  being  limited  to  describing  fields  in 
regions  outside  the  surface  region  and  the  extinction  theorem  being  difficult  to  implement 
numerically.^^  Consequently,  new  techniques  are  constantly  evolving. 

Mindful  of  the  above  remarks,  the  aim  of  this  study  is  to  provide  a  qualitative 
understanding  of  the  effects  of  surface  roughness  on  light  scattering  from  semi-infinite 
uniaxial  antifeiromagnets.  Most  of  the  previous  studies  have  considered  the  effects  of 
roughness  on  only  plasmon  polaritons  and  ferromagnedc  magnons.^^  The  most  interesting 
case  is  the  scattering  of  light  near  antiferromagnetic  surface  polariton  frequencies,  where  the 
scattered  light's  amplitude  and  linewidth  are  related  to  the  total  magnitude  of  the  power 
scattered  into  surface  polariton  states. 

The  roughness  is  assumed  to  be  slight  and  the  perturbation  method  used  by  Mills 
and  Maradudin  in  their  initial  studies  of  scattering  from  rough  metals  is  applicd^^^^  This 
method,  though  not  without  certain  shortcomings,  provides  a  reasonable  first  attack  on  the 
problem  and  can  be  shown  to  be  consistent  with  other  techniques,  at  least  to  first  order  in  the 
roughness  height  Since  this  technique  requires  the  Green’s  functions  for  a  semi-infmite 
antiferromagnet  the  results  of  the  last  chapter  are  used. 

Once  expressions  for  the  roughness  induced  scattered  fields  are  obtained,  the  power 
losses  due  to  roughness  from  a  beam  specularly  reflected  by  the  surface  of  an 
antiferromagnet  are  estimated.  Since  random  roughness  can  be  considered  as  a  superposition 
of  periodic  gratings  of  varying  heights  and  periods,  the  first  case  considered  is  for  a 
sinusoidal  one  dimensional  grating  impressed  upon  the  material’s  surface.  The  randomly 
rough  surface  is  represented  as  an  average  of  surface  profiles,  as  also  done  by  Mills  and 
Maradudin,  with  a  Gaussian  distribution  of  surface  heights  and  profiles. 

Besides  allowing  an  incident  beam  to  couple  with  either  or  both  the  +k  and  the  -k 
surface  polariton  branches,  surface  roughness  can  enhance  the  nonreciprocal  reflectivity  of 
an  antiferromagnet  in  an  applied  field.  In  other  words,  in  an  applied  field  the  difference  in 
reflectivity  between  a  beam  incident  at  +9  and  a  beam  incident  at  -9,  both  of  which  couple  to 


103 


the  surface  polariton  modes,  is  increased  by  the  presence  of  random  roughness.  There  is 
also  the  interesting  result  that  roughness  allows  coupling  between  an  incident  beam  and  the 
Brewster-like  modes  of  a  damped  andferromagnet  The  properties  of  these  damped  modes 
are  discussed  in  detail  in  chapter  4. 

The  outline  of  the  present  chapter  is  as  follows:  in  section  6.2  the  integral  equation 
for  the  scattered  fields  is  derived  and  solved  for  the  power  flow  inside  and  outside  the 
material  In  section  6.3  a  small  amplitude  periodic  grating  is  used  to  study  coupling  to 
surface  polaritons  on  MnF2.  Finally,  in  section  6.4  the  case  of  random  surface  roughness  is 
examined. 

6. 1  Bom  approximation  for  scattered  fields. 

The  geometry  is  the  same  as  before  with  the  exception  that  the  surface  is  no  longer 
smooth.  The  actual  surface  height  is  some  function  ^(x,z )  which  measures  the  deviation  of 
the  actual  surface  above  an  ideal  smooth  surface  aty  This  rough  surface  geometry  is 
shown  in  figure  6.1.  The  explicit  forms  of  e  and  p.  are  given  by  equations  (2.3)  and  (2.8). 

The  electric  and  magnetic  susceptibilities  are  uniform  everywhere  except  near  the 
surface  of  the  material.  The  only  x  and  z  dependance  of  the  susceptibilities  enters  through 
the  profile  function  ^(x,z ).  At  long  wavelengths,  a  step  function  behavior  of  the 
susceptibilities  at  the  surface  is  appropriate,  and  the  dependence  of  the  susceptibilities  on  the 
roughness  profile  ^(x,z)  is  represented  by  the  step  function  ©(^(x,z)-y ): 


eij(x)  =  Eij®  -H  (5ij-eij  °)©(^(x,z)-y ) 


(6.1) 


ltij(x)  =  lii  °  +  (5ij-iiij  °)0(?(x,z)-y  ) 


(6.2) 


104 


105 


Here  Sij  is  the  kxonecker  delta  function  which  makes  the  dielectric  and  magnetic  susceptibility 
tensors  equal  to  unit  matrices  for  the  vacuum  region.  Note  that  we've  written  the 
susceptibilities  as  tensor  components.  The  superscripts  'o'  denote  the  values  appropriate  for 
the  bulk  material  and  the  unsubscripted  £-  and  /x-  are  the  position  dependent  functions  for 
the  entire  structure  with  the  rough  surface. 

The  roughness  is  treated  as  a  small  perturbation  in  the  bulk  material.  If  Tf  and  |if  are 
the  position  dependent  flat  surface  susceptibility  tensors,  then  the  position  dependent  rough 
surface  susceptibility  tensors  £  and  p.  are  expanded  to  first  order  in  the  perturbation  as: 


T=ef  +5e 

(6.3) 

(6.4) 

Expressions  of  this  form  can  be  found  by  expanding  the  susceptibilities  in  (2.1)  and 
(2.2)  in  a  Taylor's  series  about  ^(x,z)=0.  The  first  order  terms  appearing  in  this  expansion 
are 


5eij  =  C(x,z)  (8ij-  Eij  ®)  5  (-y) 

(6.5) 

=  C(x,z)  (6ij-^ij  0)  5(-y) 

(6.6) 

The  electromagnetic  fields  arc  also  written  in  a  perturbation  expansion.  The 
unperturbed  fields  are  Hq  and  Eg  and  the  first  order  corrections  are  and  A  time 
depei.  ounce  for  H  and  E  of  the  form  exp(-io)t)  is  assumed.  From  Maxwell's  equations  a 
wave  equation  for  the  scattered  fields,  is  derived  and  is  valid  to  first  order  in  the 


perturbations: 


106 


vx  V  X  if®  -  ©2^*® = -i(o  vxt^ 


For  the  rest  of  the  paper,  cOg  =  (fl/c.  An  equivalent  expression  for  (6.7)  is 


j  m  \  ^  *■  ^  •  i 


The  prime  on  the  sum  means  that  m^j,kj‘  Since  the  dielectric  tensor  is  diagonal,  eicit=€jt. 
Note  that  for  fields  in  the  vacuum  region  the  susceptibilities  are  given  by  |iij=5jj  and  eij=Sjj. 
The  vector  F  represents  the  driving  terms  and  is  given  by 


(6.9) 


Here  is  the  Levi-Civiia  tensor. 

The  perturbed  wave  equation  (6.8)  is  solved  by  a  Green's  function  method.  The 
scattered  field  is  determined  by 


(6.10) 

where  gjj  is  the  ij^  component  of  the  semi-infinite  antiferromagnetic  green's  function  tensor. 
The  volume  of  integration  is  over  all  of  space.  Note  that  F  is  a  function  of  the  unperturbed 
H°  and  E®  fields  appropriate  to  the  material  region. 


107 


For  reasons  discussed  in  chapter  5,  the  simpler  case  of  propagation  in  the  xy  plane 
with  the  incident  beam  polarized  perpendicular  to  the  plane  of  incidence  is  considered.  In 
this  case  none  of  the  quantities  in  (6.8)  depend  on  z  and  a  Fourier  transform  in  the  x  variable 
is  used.  The  one  dimensional  Fourier  expansion  for  g^j  is 


J  27C 


«  8ijik,iy.y) 


(6.11) 

The  relevant  quantities  for  the  transform  of  F  are  and  IJ.  Also,  only  surface  irregularities 
that  depend  only  on  the  x  coordinate  and  are  independent  of  z  can  be  described  in  this 
framework.  This  restriction  allows  a  reasonable  description  of  gratings  but  is  incapable  of 
fully  describing  a  randomly  rough  surface.  To  see  this,  suppose  the  roughness  varied  in  the 
z  direction.  One  would  then  expect  scattering  into  waves  which  do  not  propagate 
perpendicularly  to  the  applied  field  and  in  addition  there  could  be  some  scattering  of  the 
perpendicular  polarized  incident  wave  into  parallel  polarized  states.  In  this  simplified  theory, 
only  scattering  within  the  same  polarization  state  is  accounted  for.  This  theory  can  thus 
describe  a  grating  with  a  variable  period  (one  dimensional  random  roughness),  but  not  a  tme 
two  dimensional  random  surface. 

The  X  dependent  surface  profile  function  has  the  one  dimensional  expansion 


oe 


-ikjX 

e 


(6.12) 


108 


Finally,  the  unperturbed  fields  have  only  one  Fourier  component: 


(6.13) 


£;(x)=«  E;(k^^,y) 


(6.14) 

k^  is  the  wavevector  of  the  unperturbed  wave.  In  this  problem,  the  unperturbed  field 
originates  from  a  travelling  electromagnetic  wave  incident  on  the  rough  surface  of  the 
antifenomagnet  The  amplitude  of  the  unperturbed  fields  in  (6. 13)  and  (6.14)  then  depends 
on  k^  and  is  determined  in  each  region  of  space  by  the  appropriate  Fresnel  relations.  The 
pertinent  formulas  are  presented  in  the  Appendix. 

Using  the  transforms  (6.11-6.14),  the  scattered  field  of  (6.10)  can  be  written  in  the 
compact  notation 


(6.15) 


109 


The  components  of  the  vector  A  are  defined  by 


oe 


(6.16) 


and 


00 


,(*,:y.y')a[-y') 


oe 

+ ^1-e^  )K}s„(i‘,;.y1  S(.y') 


(6.17) 

In  deriving  these  formulas  an  integration  by  parts  has  been  done,  shifting  the  derivatives  off 
of  5eE  and  onto  the  green's  functions. 

As  noted  by  Mills  and  Maradudin^  and  others^^,  the  evaluation  of  the  integrals  in 
(6.16)  and  (6.17)  is  not  obvious  since  the  discontinuity  of  the  integrand  allows  for  a  number 
of  possible  solutions.  Following  Mills,  a  prescription  consistent  with  previous  perturbative 
treatments  is  taken.  This  prescription  makes  the  source  of  the  scattered  fields  proportional  to 
the  amplitude  of  the  unperturbed  fields  inside  the  material,  as  they  are  in  equation  (6.9),  but 


110 


located  in  vacuum  just  outside  the  y  =0  plane.  Hence  the  Green's  functions  appropriate  for 
sources  in  the  y  <0  space  are  used  with  driving,  fields  appropriate  to  the  y  >0  space. 

These  choices  result  in  the  Mowing  expressions  for  A: 


and 


(6.18) 


+ 


0)  , 

)]  +<■£/  -2(7-62  )k}gyy(k^;y..  ) 


(6.19) 

The  fields  and  Green's  functions  are  evaluated  at  the  flat  surface,  the  ±  signs  indicating  to 
which  side  of  y'  =0  the  expressions  belong.  The  derivatives,  of  course,  are  evaluated  before 
the  limiting  process. 

The  energy  flow  in  the  scattered  fields  can  now  be  calculated.  The  time  averaged 
Poynting  vector  is 


S  =  (%^)ESCxHSC* 


(6.20) 


Ill 


One  fiirther  notational  device  is  required.  Since  the  Green's  functions  and 
wavevectors  for  y  >0  are  different  than  those  for  y  <0,  the  expressions  E,  H  and  S  will  also 
differ  depending  on  whether  they  are  taken  in  the  material  or  outside  of  it  For  the  rest  of  the 
paper  then,  the  subscript  ">"  will  indicate  that  an  expression  is  valid  inside  the  material  and 
"<"  will  indicate  that  an  expression  is  valid  outside  the  material  (in  vacuum).  To  reduce  the 
number  of  equations  which  follow,  it  is  useful  to  define  e>=€o  and  e<=l. 

Away  from  the  rough  surface,  the  perturbed  fields  satisfy 

cOq  eE“  =  -Ic  X  3“  (6.21) 

where  the  indicates  that  one  must  choose  either  the  expression  appropriate  to  the  y  <0 
or  the  y  >0  region.  The  perturbed  electric  fields  are 


ikjt 


The  wavevector  inside  the  material  is  given  by 


(6.22) 


(6.23) 


where  the  normal  component,  ky,  is  found  through  the  dispersion  relation 


(6.24) 

This  relation  is  found  by  setting  the  determinant  of  the  homogeneous  equations  of  motions 


112 


to  zero  as  discussed  in  previous  chapters.  The  vacuum  wavevector  outside  the  material,  k^, 
is  defmed  by 


(6.25) 


with  the  normal  component  given  by  the  free  space  dispersion 

k  =-,/  (O^-k^ 
y<  V  0  Jc 

(6.26) 

Note  that  in  both  (6.24)  and  (6.26)  the  signs  on  the  radicals  have  been  chosen  so  as  to 
represent  waves  decaying  exponentially  away  firom  the  surface  when  the  arguments  of  the 
radicals  are  real  and  negative.  Likewise,  the  signs  indicate  waves  travelling  away  from  the 
surface  when  the  arguments  of  the  radicals  are  positive  and  real. 

Taking  the  product  x  ,  the  foUowing  quantities  are  defmed 


X 


e 


(621) 

The  vector  functions  P(kjj,y )  give  the  scattered  power  flow  per  unit  area  with  wavevector 
component  parallel  to  the  surface  in  a  range  k^,  +  dJt^ ,  at  a  distance  y  away  from  the 

surface  of  the  material. 


113 


The  scattered  power  flows  are  then  given  simply  by 


<  J  < 

(6.28) 

Since  the  scattered  fields  and  power  flows  depend  on  y  through  a  complex 
exponential  term  originating  in  the  Green's  functions,  for  further  calculations  it  is  useful  to 
illustrate  this  dependance  explicitly  by  writing 


‘'^>y 


(6.29) 

A  y  independent  portion  of  P  can  also  be  defined,  but  the  specific  form  depends  on 
the  profile  nmciion  s(kjj).  Two  different  forms  for  ^(kjj)  are  chosen  here:  one  for  a  periodic 
grating  (see  equation  (6.43)),  and  one  for  a  random  grating  (see  equation  (6.5 1)).  When 
averaged  over  the  profile  distribution  for  the  random  grating,  the  integral  over  k^'  in  equation 
(6.27)  is  equal  to  the  product ^(kjj)exp(-2ylm(ky^)).  ^(k^)  is  given  by 


wxx  (k,))xx  (k.) 

<  256jt  <  <  <  < 

0 

(6.30) 

Here  h  is  the  average  height  of  the  roughness,  and  a  is  the  correlation  length  of  the  random 
profile  distribution. 

Finally,  note  that  it  is  only  the  real  part  of  (6.28)  that  is  meaningful.  Consequently, 
in  the  calculations  which  follow,  it  is  to  be  understood  that  only  the  real  parts  of  (6.28)  and 

(6.30 )  are  used  where  appropriate. 


114 


For  a  grating  of  period  s,  the  integral  in  (6.27)  has  a  more  complicated  form: 


where  q  takes  the  values  k^jQ+s,  kj^^-s,  and  k^^^  with  k^o  the  x  component  of  the  incident 
wavevector.  When  q=  kj^^+s  or  k^Q-s,  p>(kjj,q)  is  given  by 

h2 

"5  ,  (^'k>(q)x^>(q))x^*(q) 

647C  or  <  <  <  < 

o 

Here  h  is  the  grating  height  The  calculation  of  (6.31)  and  (6.32)  involve  averages  over  x 
under  the  assumption  that  the  width  of  the  illuminated  surface  area  is  much  larger  than  s. 
This  is  discussed  in  section  6.3.  Finally,  when  q=kjjQ,  P5,(kjj,q)  contains  the  factor  256 
instead  of  the  64. 

Next  the  question  of  how  to  represent  the  power  lost  from  the  specularly  reflected 
beam  is  considered.  First  the  Poynting  vectors  (6.28)  are  normalized  to  the  illuminated 
surface  area  by  dividing'p  with 


^.  =  W%;r)^‘^'cos0,  (6.31) 

is  the  amplitude  of  the  unperturbed  incident  field,  is  the  width  in  x  of  the  area 
illuminated  by  the  incident  beam  and  is  the  width  in  z.  Qq  is  the  angle  of  incidence  defined 
in  figure  6.1. 


115 


The  real  part  of  the  normalized  expressions  are  integrated  over  an  appropriate  surface 
to  obtain  the  net  scattered  energy  flow  per  unit  surface  area.  The  vacuum  energy  flows  are 
derived  first  The  calculation  begins  by  separating  the  integral  into  integrals  over  two 
different  regions.  In  the  region  U:J<lk<l  the  integral  describes  the  flow  of  energy  in  radiative 
states. 

Defining  this  flow  as  then 


P(^„) 


(6.32) 

The  surface  of  integration  for  the  radiative  integral  is  a  cylinder  of  radius  /2  and  length 
whose  axis  coincides  with  the  z  axis  of  this  geometry.  In  the  vacuum  region,  <  is  pure 
real  for  the  radiated  energy  and  the  integral  is  straightforward: 


(6.33) 

In  the  region  W:J>lk<l  the  integral  describes  the  flow  of  energy  in  evanescent  states. 
Calling  this  I<®,  then 


0 


(6.34) 


The  energy  flow  in  the  evanescent  fields  is  parallel  to  the  surface  in  the  ±x  direction,  so  the 
surface  integral  is  over  a  strip  of  width  that  extends  from  y=0  to  y=oo.  The  surface 
element  da  is  always  taken  in  the  +x  direction.  Since  ky<  is  pure  imaginary  for  all  kj^  in  this 


116 


wavevector  region,  the  surface  integral  is  easily  performed  with  the  result 


A  . 


(6.35) 

In  the  material,  ICy^  may  be  complex  if  damping  exists.  For  the  half  cylinder  region 
within  the  niaterial,the  imaginary  part  of  ky  in  the  radiative  integral  is  due  solely  to  damping 
and  represents  an  exponential  decay  of  the  wave  into  the  material.  Under  the  assumption  that 
the  width  of  the  illuminating  beam,  ,  is  much  smaller  than  the  decay  length,  the  surface 
integral  in  the  material  is  done  exactly  as  in  the  vacuum  case: 


(6.36) 

The  evanescent  flows  in  t'^e  material  are  found  in  the  same  manner  as  in  the  vacuum  case 
(except  with  the  limits  of  integration  over  y  from  0  to  -<»)  with  the  result 


A  — >  . 

21Im(/:yJI 


(6.37) 

The  ultimate  goal  is  to  calculate  the  change  in  reflectance  due  to  tlie  surface 
roughness.  First  the  sums  I>=|I>®I  +|I>^|  and  I<  =|I<®|  are  defined.  It  is  important  to 
note  that  while  and  represent  scattering  into  states  inside  and  outside  the  material 
respectively,  they  do  not  by  themselves  determine  the  fraction  of  power  scattered  out  of  the 
specularly  reflected  beam.  The  most  one  can  say  is  that  if  AR  and  AT  represent  the  changes 
in  reflectance  and  transmittance  due  to  roughness  induced  scattering,  then 


117 


(^>+^<)ns=  ^  +4r  (6.38) 

Only  the  nonspecular  portions  of  the  scattering  ratios  in  (6.38)  are  needed  to  calculate  the 
losses  from  the  specular  beams.  This  is  emphasized  by  the  subscript  "ns". 

To  approximate  di?  ,  first  write 

AR  =  (/^+  /<)ns/(^+  ^ )  (6.39) 

and  assume  that  the  perturbing  roughness  does  not  have  an  appreciable  effect  on  the  ratio  of 
transmitted  flux  to  reflected  flux.  This  approximation  is  written 


ATIAR^T/R  (6.40) 

whereTand/?  are  the  unperturbed  transmittance  and  reflectance.  The  change  in  reflectance 
due  to  surface  roughness  is  then 


^  =  iI>+U)ns^U+T/R)  (6.41) 

The  ratio  T  /R  is  calculated  in  Appendix  I. 

6.2  One  dimensional  grating. 

The  power  scattered  by  a  periodic,  sinusoidal  grating  ingrained  upon  the  surface  of 
the  antiferromagnet  is  now  calculated.  Here  the  surface  profile  function  takes  the  form 

^  (x )  =  h  (cos(sx )  -1  )/2 


(6.42) 


118 


The  grating  depth  is  given  by  h,  which  is  assumed  small  in  comparison  to  the  incident 
wave's  wavelength.  The  spatial  period  of  the  grating  is  Us,  Note  that  the  expression  for  the 
grating  is  such  that  C(x)<0,  as  required  by  the  evaluation  of  the  integral  in  equation  (6. 16) 
and  (6.17). 

Transforming,  as  per  equation  (6.12),  the  profile  function  becomes 

(6.43) 

When  substituted  into  the  expressions  for  S  given  by  (6.28),  the  delta  functions  pick  out  the 
Fourier  coefficients  of  the  scattered  power  flow  corresponding  to  ,  k^  -s,  and  k^ 

fix)m  the  integrals  over  k^  and  k^.  These  terms  correspond  to  the  zero  order  reflected  beam 
plus  the  first  order  diftiacted  beams.  When  evaluating  the  expressions  for  p,  however,  one 
encounters  terms  proportional  to  exp(2«x)  and  exp(i£x)  which  represent  interference  terms 
between  the  three  beams.  If  a  large  portion  of  the  grating  is  illuminated,  the  scattered  beams 
are  averaged  over  a  large  range  of  x.  In  this  spatial  average,  the  interference  terms  vanish. 

Substituting  the  profile  function  (6.43)  into  the  real  parts  of  equations  (6.28)  the 
following  expressions  for  the  power  flow  into  the  material  are  obtained: 

0 

+(  ^  )  X  5) 

+7(  t  I 


(6.44) 


119 


Here  we've  introduced  the  abbreviations  )  and  k^^°  =*3,>(*xo)-  The 

functional  form  for  k^^  is  given  explicitly  by  equation  (6.26). 

The  brackets  around  S  indicate  that  the  expression  is  averaged  over  x  with  the 
assumption  thatL^  is  much  larger  than  Us.  Outside  the  material,  the  Poynting  vector  is: 


<S> 


-ch‘ 


64n^ 


(0‘ 


+  (  s)  e 

+  4  (  ^  ^  V^x<»)  )  X  X{k^^)  e  | 

(6.45) 

The  k^  dcpendance  of  k^^  is  given  by  equation  (6.28)  and  is  abbreviated  in  the  same  manner 
as  in  (6.44). 

Several  comments  are  in  order.  First,  these  expressions  represent  power  flow  due 
to  sources  originating  in  the  surface  grating.  These  sources  are  driven  by  the  incident  fields. 
When  the  surface  roughness  is  "turned  off,  the  scattered  fields  must  vanish.  This  obviously 
holds  for  the  scattered  power  flows  of  (6.44)  and  (6.45). 

When  the  surface  roughness  is  "turned  on",  some  of  the  incident  energy  is  scattered 
out  of  the  specularly  reflected  beam.  This  is  clearly  represented  by  the  first  two  terms  in 
(6.44)  and  (6.45),  which  describe  scattering  into  the  first  order  diffracted  beams.  Inclusion 
of  higher  order  perturbation  terms  in  the  wave  equation  (6.7)  for  the  scattered  fields  would 
result  in  the  appearance  of  higher  order  diffracted  beams  in  the  power  flow  expressions. 

The  surface  roughness  does  not  necessarily  scatter  the  incident  energy  in 
non-specular  directions,  however.  The  third  terms  in  equation  (6.44)  and  (6.45)  represent 


120 


scattering  into  the  zeroth  order  diffracted  beam.  These  describe  power  flow  scattered  in  the 
direction  of  the  unperturbed  specular  beams. 

For  certain  values  of  k;y,is ,  will  be  pure  imaginary  and  will  thus  describe 
fields  exponentially  decaying  away  from  the  surface.  These  fields  correspond  to  evanescent 
waves  travelling  along  the  surface  perpendicular  to  the  easy  axis.  Whether  is  real  or 
imaginary  depends  on  the  magnitude  of  k^  (which  depends  on  the  angle  of  incident  wave 
with  respect  to  the  surface)  and  the  magnitude  of  the  grating's  period  Us. 

When  the  grating  induces  evanescent  waves  travelling  along  the  surface,  there  exist 
frequencies  where  surface  polariton  modes  can  be  excited.  In  this  way  the  grating  allows 
for  coupling  between  the  incident  electromagnetic  wave  and  surface  antiferromagnetic 
polaritons.  To  illustrate  this,  in  figure  6.2  the  dispersion  curves  for  bulk  and  surface 
polaritons  are  reproduced.  In  this  example  and  the  rest  that  follow,  the  material  is  MnF2,  a 
uniaxial  antiferromagnet  with  the  parameters  //g=7.87KGauss,  //^=550KGauss, 
A/^=.6KGauss  and  £^5.5.  The  antiferromagnetic  resonance  frequency,  ft,  is  given  by 
ft=  where  y  is  the  gyromagnetic  ratio.  Unless  otherwise  stated, 

damping  is  always  present  with  the  value  l/ftx=.(XX)l.  The  unitless  frequency  and 
wavevector  are  given  by  (O/ft  and  kc/ft,  respectively.  Also,  the  approximation  discussed  in 
chapter  5  is  used  to  allow  the  Green's  functions  to  represent  leaky  wave  excitations. 

These  curves  in  figure  6.2  are  for  zero  applied  field.  The  shaded  areas  represent 
bulk  polariton  modes  and  the  dotted  lines  rising  out  of  the  lower  bulk  band  are  surface 
polariton  modes.  The  solid  lines  inside  the  bulk  bands  and  above  the  surface  modes  are  the 
leaky  modes  described  in  Chapter  4.  The  straight,  nearly  vertical  line  is  the  light  line  where 
(Oq  =k.  The  two  straight  dashed  lines  correspond  to  the  grating  induced  wavenumbers  k^±s 
for  the  case  k^cICh^  .7  and  s  =  .5c/ft  (In  MnF2,  with  an  antiferromagnetic  resonance 
frequency  ii=268GHz,  this  s  is  approximately  .5mm).  For  these  values,  k^^  is  pure 
imaginary  for  the  +s  case  and  the  corresponding  scattered  fields  decay  exponentially  away 
from  the  surface. 


121 


Figure  6.2.  Dispersion  curve  for  antiferromagnetic  polaritons,  in  MnF2,  with  no  applied 
field.  The  shaded  areas  are  bulk  bands  and  the  dotted  lines  are  surface  modes. 

•  >  The  solid  lines  are  leaky  modes.  The  straight  solid  lines  are  light  lines, 

where  (o^K.  The  straight  dashed  lines  are  the  grating  induced  lines,  q=k^, 
for  kyCl^.l  and  s^.Sdil. 


122 


The  light  line  does  not  intenect  any  of  the  surface  polariton  modes,  but  for  this 
choice  of  s,  the  grating  induced  line  of  kj^Q  +  s  intersects  a  surface  mode.  The  presence  of 
the  grating  thus  allows  an  incident  electromagnetic  wave  to  couple  with  polariton  modes 
inaccesible  when  the  surface  is  ideally  smooth.  Note  that  higher  order  terms  in  the  power 
flow  expressions  would  result  in  more  grating  induced  lines  being  drawn,  and  hence 
couplings  to  portions  of  the  surface  polariton  modes  at  even  shorter  wavelengths. 

The  sum  of  the  +s  and  the  -s  terms  for  the  parallel  power  flows  are  plotted  in  figure 
6.3  for  Hq=0,  s=.5c/£2  and  6^  =45®.  Plotted  are  <5^*^  >/ch^  just  outside  the  material  in 
vacuum  aty=0-  and  <S^^>/ciP'  just  inside  the  material  aty=0+.  The  large  peak  at  O) 
=l.(X)lf2  is  approximately  where  the  grating  line  intersects  the  surface  mode  shown  in  the 
dispersion  curves  of  figure  6.  Note  that  the  energy  flow  outside  the  material  is  larger  and 
oppositely  directed  to  the  energy  flow  inside  the  material.  Inside  the  material,  the 
susceptibilities  are  negative  for  the  surface  modes.  In  consequence,  the  Poynting  vector  is 
also  negative  and  oppositely  directed  to  the  vacuum  Poynting  vector.  This  observation  has 
also  been  made  for  surface  plasmon-polaritons  and  surface  magnetoelastic  polaritons. 

The  smaller  peak  in  figure  6.3  is  from  a  surface  resonance  that  lies  within  the  bulk 
band.  This  peak  corresponds  to  a  location  on  the  dispersion  curve  of  figure  6.2  at  O) 

=.998X2  on  the  -s  grating  line.  Note  that  this  places  the  resonance  on  the  left  side  of  the 
light  line,  in  a  region  where  true  surface  polaritons  cannot  exist  These  resonances  are  due  to 
the  Brewster-like  leaky  modes  described  in  Chapter  4.  A  similar  strong  excitation  of  a  leaky 
mode  due  to  the  coupling  provided  by  a  grating  has  been  discussed  for  the  case  of  elastic 
waves  by  Glass  and  Maradudin. 


<S>.<Sx>  (arb.  units) 


123 


©/■Q 


Figure  6.3.  Parallel  ^wer  flows  inside  and  outside  the  material  as  functions  of  frequency 
on  a  periodic  grating  with  s^.SdCl  The  quantities  shown  are  proportional  to 
<Sj^  >  and  There  is  no  applied  field,  ^^=45®,  and  damping  is 

///jr=.0001.  The  larger,  higher  frequency  peak  is  due  to  a  surface  polariton. 
The  other  peak  results  from  a  surface  resonance  that  lies  within  the  bulk 
polariton  band. 


124 


The  leaky  waves  depend  on  material  damping  for  their  existence  and  so  it  is 
interesting  to  study  them  for  larger  damping  parameters.  In  figure  6.4  the  parallel  power 
flows  are  plotted  for  the  same  values  of  s  and  as  in  figure  6.3  but  with  a  larger  damping 

constant  of  .0008.  The  quantity  actually  plotted  is  <S^/c/t2.  The  larger  peak  is  still  the 
surface  polariton  but  there  is  a  distinct  broadening  of  both  the  resonance  and  the  polariton. 

In  both  figures  6.3  and  6.4  also  note  the  power  flows  inside  the  material  are  opposite  those 
outside  the  material  for  both  surface  excitations.  While  the  surface  polariton  carries  most  of 
its  energy  outside  the  material,  the  resonance  carries  most  of  its  energy  inside  the  material. 

Having  examined  the  possible  energy  flows  inside  and  outside  the  material,  the 
fraction  of  energy  scattered  out  of  an  incident  wave  into  these  radiative  and  evanescent  states 
can  be  calculated.  For  the  periodic  grating  the  power  ratio  equations  (6.33, 6.35, 6.36,  and 
6.37)  become 


ah^cos(,6^ 


‘^rc  of  cos(flp 


(<7)  - 


(6.46) 


32L^mk^,{q)\H“’nofe^coi(e^ 


(6.47) 


- h  a  4-, -  (^y< 


(<7)  -  i(l))^yS‘i) 


(6.48) 


125 


(o/Q 


Figure  6.4.  Parallel  ^wer  flows  inside  and  outside  the  material  as  functions  of  frequency 
on  a  periodic  grating  with  s=.5c/i2  The  quantities  shown  are  proportional  to 
>  and  There  is  no  applied  field,  0^=45®,  and  damping  is 

nowlIQr  =  .0008.  The  peaks  are  much  broader  now  than  in  the  .OWl 
damping  case  of  figure  6.3. 


126 


Here  q  is  understood  as  either  ifc^,  or  k^^-s.  As  discussed  earlier,  the  magnitude  of  q 
determines  whether  the  power  flow  is  radiative  or  evanescent  Thus  I J  represents  radiative 
scattering  in  the  material  when  \q\<\k.Ji  and  1/  represents  evanescent  scattering  l9l>|k^l. 
Similarly,  1/  and  represent  radiative  and  evanescent  scattering  in  vacuum  when  l^|<|k^| 
and  l9l>lk^|. 

Before  proceeding,  recall  that  in  an  applied  Held  the  surface  modes  become  highly 
nonreciprocal  with  respect  to  propagation  direction  (i.e.,  CD(+k^);^-k^)).  For  future 
reference,  the  dispersion  curves  for  the  bulk  and  surface  polaritons  in  an  applied  field  of 
.3kG  are  presented  in  figure  6.5  for  l/nx=.0(X)l.  The  shaded  areas  are  again  bulk  polariton 
bands  and  the  dashed  lines  are  surface  polariton  modes.  The  solid  lines  are  surface  leaky 
modes. 

Now  it  is  possible,  for  certain  choices  of  grating  period  s,  to  couple  an  incident 
wave  whose  parallel  wavevector  component  is  in  the  +x  direction  with  surface  modes 
whose  wavevectors  are  in  the  -x  direction.  For  example,  suppose  a  wave  of  frequency  (O 
incident  on  a  grating  with  period  1/s  at  some  angle  6q  couples  to  a  surface  mode  travelling 
in  the  -x  direction  with  wavevector  kj^^-s.  It  is  also  possible  for  an  incident  wave  at  another 
frequency  <o'  to  couple  with  a  surface  mode  travelling  in  the  +x  direction  with  wavevector 
k^jj'+s  for  the  same  angle  of  incidence.  Thus  for  fixed  s  and  0q,  one  can  couple  to  both  the 
+kx  and  the  -kj^  polariton  branches  by  scanning  the  frequency  of  the  incident  wave.  Placing 
the  material  in  applied  field  will  increase  the  difference  in  frequencies  by  exploiting  the 
nonreciprocity  of  the  surface  modes.  An  example  is  given  in  figure  6.6  where  the  evanescent 
power  flows  are  plotted  as  functions  of  frequency  for  J=3c/^2,  and  6q=-45®.  An  external 
field  of  .3kG  is  applied.  The  quantities  actually  plotted  are  I ^‘LJhc^. 


(o/Q. 


127 


Figure  6.5.  Dispersion  curve  for  andferromagnetic  polaritons,  in  MnFj,  with  an  applied  field 
of.3kG.  The  shaded  areas  are  bulk  bands  and  the  dottedlines  are  surface 
modes.  The  solid  lines  are  leaky  surface  modes.  TTie  straight  solid  lines  are  the 
light  lines,  where  The  straight  dashed  lines  are  the  grating  induced 

lines,  q=k^,  for  k^lu=-.l  and  s-'idCl.  Note  the  strong  nonreciprocity  of 
the  surface  modes  with  respect  to  propagadon  direction. 


128 


co/Q 


Figure  6.6.  Evanescent  power  nows  inside  and  outside  the  material  as  functions  of 
frequency  on  a  periodic  grating  with  s^'idO.  The  quantities  shown  are 
proportional  to  I  ‘  and  l^*.  The  applied  field  is  .3kG,  d^=-45®,  and  damping  is 
7/jf2T=  .0001.  For  this  s,  we  now  pick  up  both  the  surface  mode  and  the 
surface  mode.  The  peaks  labelled  p  occur  at  frequencies  where  the 
susceptibilities  become  very  large. 


129 


To  identify  the  peaks  in  this  figure,  refer  to  the  dispersion  curves  of  figure  5  where 
there  are  lines  representing  the  grating  induced  wavevectors  for  s  -3c/£2  and  k^c/Q  =-.7. 
The  peak  nearest  co/Q  =1  in  figure  6  corresponds  to  where  the  -s  grating  line  crosses  the  -k^ 
surface  polariton  mode  in  figure  6.6.  The  peak  at  O)  =1.00512  in  figure  6.6  corresponds  to 
where  the  +s  grating  line  crosses  the  +k^  surface  polariton  mode. 

The  power  flow  for  both  modes  is  largest  in  vacuum  and  in  the  directions  expected 
for  the  different  modes:  //  is  positive  for  the mode  and  negative  for  the mode. 
Also,  note  that  the  power  flow  in  the  material  is  no  longer  in  a  direction  opposite  that  of  the 
vacuum  flow.  This  does  not  contradict  the  zero  field  case  because  although  is  still 
negative,  the  applied  field  also  influences  the  direction  of  the  material  power  flow  through  the 
off  diagonal  element  of  the  susceptibility  tensor, 

The  two  peaks  labelled  "|i,"  are  at  the  antiferromagnetic  resonance  frequencies.  At 
these  frequencies and /ij- become  very  large  and  change  sign.  Since  the  driving  fields  of 
the  perturbation  expansion  are  proportional  to  fij  and  /Uj-.  the  theory  breaks  down  at  these 
frequencies.  Thus  the  magnitude  of  the  energy  flow  in  the  evanescent  (and  radiative)  fields 
at  these  frequencies  may  be  exaggerated  with  respect  to  the  magnitude  of  the  surface 
polariton  peaks. 

In  figure  6.7  the  reflectance  of  the  smooth  surface,  /?,  and  the  reflectance  of  the 
rough  surface,  R-M,  is  plotted  vs  6^  for  two  frequencies.  There  is  no  applied  field  and  s 
=  lc/j(2.  The  grating  height  is  h=.0002cli2  (for  MnF2,  this  height  is  approximately  .2|i.m) 
and  the  width  of  the  illuminating  beam,  ,  is  set  equal  to£2/c  for  convenience.  The 
presence  of  the  grating  introduces  dips  in  to  the  reflectance  which  are  not  seen  for  the  smooth 
surface.  These  dips  are,  as  usual,  due  to  the  coupling  to  surface  polaritons,  in  this  case  the 
"true”  surface  polaritons,  not  the  surface  resonances.  The  dips  are  reciprocal  in  angle  and  are 
much  farther  apart  at  the  higher  frequency  than  at  the  lower.  Tnis  is  due  to  the  positive  group 
velocity  of  the  modes,  as  seen  in  figure  6.2,  where  the  higher  frequency  modes  exist  at  larger 


wavevectors. 


130 


^  I  '  'I  -I  I  I  I I  'I  ■■  ■— I  I  — »  —I 

-90  -60  -30  0  30  60  90 


6 


Figure  6.7.  Reflectivity  as  a  function  of  Qq  for  a  smooth  surface  (dashed  lines)  and  a  surface 
with  a  grating  of  period  s^ldCl  (solid  lines).  There  is  no  applied  field  and 
damping  is  //I2r=.0001.  The  two  frequencies  couple  with  st^ace  polaritons 
with  different  group  velocities.  The  higher  frequency  polariton  has  a  smaller 
group  velocity  and  consequently  has  a  ^eater  density  of  states.  Thus  it 
produces  a  broader  and  deeper  reflectivity  dip  than  the  lower  frequency  mode. 


131 


An  interesting  point  is  the  increased  width  of  the  dips  at  the  higher  frequency.  This 
is  due  to  the  group  velocity,  which  decreases  with  increasing  frequency,  thus  creating  a 
higher  density  of  states  at  the  higher  frequency.  Since  damping  allows  the  incident  wave  to 
excite  surface  polaritons  over  a  small  range  of  frequency,  a  greater  density  of  states  allows 
the  incident  wave  to  excite  a  larger  number  of  polariton  states. 

6.3  Randomly  rough  surface. 

The  case  of  random  roughness  is  somewhat  simpler  algebraically  than  the  one 
dimensional  grating.  First,  the  average  of  S  over  the  possible  height  profiles  ^(x)  is 
calculated.  A  gaussian  form  for  the  correlation  function  is  assumed:^^ 


<  C(x)  ^*(x+x')  >  =  h^e  ’^^*^ 


(6.50) 

Here  h  corresponds  to  the  mean  square  height  <l?{x)>  and  a  is  the  correlation  length  for 
the  distribution  of  profiles.  The  fourier  transform  is  simply 


<C(jk)C  \k)>  =  %  2  g 

4k 


(6.51) 


Using  the  profile  correlation  function  of  (6.51),  the  power  ratio  equations  (6.33, 6.35, 6.36, 
and  6.37)  are  given  by 


ll<f 


I 


dk 


(kX  (k) 

^  X  y>''  x' 


■k  X 

y>  x> 


»M> 


e 


(6.52) 


132 


and 


f _ aV 

^  («//*  4  I*  ^ 


(*xV  (y 


■t  A 

)K  X< 


<k^-kJ(//4 

e 


for  the  radiative  fields,  and 


(6.53) 


aV 


-<V<'jV/4 


.) 


(6.54) 


^  aV 

""  /2SLy  ‘  coi(d^ 


H:,l>l  kj 


<k^-kJ<//4 


|Im(^y^  )l 


(6.55) 

for  the  evanescent  Mds. 

The  change  in  reflectivity  is  calculated  by  (6.41)  as  for  the  periodic  grating.  The 
nonspecuiar  portions  of  1/  and  1/  are  calculated  by  excluding  from  the  integration  a  small 
region  about  k^.  This  exclusion  has  a  physical  interpretation  when  the  Fourier  amplitudes  at 
each  are  related  to  a  scattering  angle,  ,  via  the  geometrical  relation 


*^=|kjsin6^  (6.56) 

In  a  reflection  experiment,  the  excluded  region  would  be  the  angular  width  of  a  detector. 
This  width  is  taken  as  one  degree  and  the  k^  values  in  the  range  |k!|sin(6Q+^/2)  to 
lk^|sin(6Q-^/2)  are  excluded  from  the  integrations  in  (6.52-6.55). 


133 


With  random  roughness,  there  is  an  infinite  collection  of  grating  periods  which 
produce  an  effective  "width"  in  wavelength  analogous  to  the  damping  width  in  frequency. 
This  width  is  measured  by  the  correlation  length  <T.  Smaller  values  of  <t  lead  to  a  greater 
smearing  in  wavelength  while  larger  values  indicate  a  smoother  surface  and  a  narrower 
wavelength  range. 

The  dependance  of  the  scattering  on  <t  is  shown  in  figures  6.8  and  6.9.  In  both 
figures  1/  is  plotted  as  a  function  of  frequency  for  0q=-45.  There  is  an  applied  field  of 
.3kG.  In  figure  8,  a=.\cl£2  (for  MnF2,  this  correlation  length  is  approximately .  1mm).  The 
largest  peak  represents  coupling  with  the  surface  mode  of  figure  6.5.  The  smaller  peak 
at  the  higher  frequency  represents  coupling  with  the  +/(:^  surface  mode.  lo  both  cases  the 
largest  coupling  occurs  where  the  density  of  states  is  the  greatest  In  figure  6.5  this  occurs 
where  the  surface  branches  flatten  out  and  the  group  velocity  approaches  zero.  This  occurs 
at  rather  large  wavevectors  and  so  requires  a  fairly  rough  surface  to  allow  coupling  with  an 
incident  wave.  The  "|i"  peaks  discussed  earlier  are  present,  but  dwarfed  in  comparison  to 
the  surface  mode  peaks. 

In  figure  6.9,  cr=lc/l2  (a  correlation  length  of  about  1mm  for  MnF2)  and  a  distinctly 
different  profile  for  the  scattered  energy  results.  First,  the  magnitude  of  the  surface  mode 
peaks  is  considerably  reduced  in  comparison  to  the  "|i"  peaks  which  are  now  clearly  visible 
at  (OjQ  =.997  and  (OjOi  =  1.003.  The  surface  mode  peaks  are  also  at  lower  frequencies 
than  the  rougher  surface  case.  The  shift  in  frequency  and  the  reduced  amplitudes  of  these 
surface  mode  peaks  are  both  due  to  lack  of  coupling  to  the  higher  (and  denser)  frequency 
portions  of  the  surface  mode  dispersion  curves.  The  strongest  coupling  for  this  smoother 
surface  now  occurs  nearer  to  the  light  line. 


134 


co/£2 


Figure  6.8.  Evanescent  power  flows  outside  the  material  as  functions  of  frequency  on  a 
rough  surface  with  os.lc/fl  The  quantity  shown  is  proportional  to  //.  The 
applied  field  is  .3kG,  and  damping  is  J/£it-.QQOl.  For  this  a,  we 

pick  up  both  the  +kr  surface  mode  and  the  surface  mode  near  their  limiting 
frequencies  where  the  density  of  states  is  largest. 


135 


(0/Q 


I  Figure  6.9.  Evanescent  power  flows  outside  the  material  as  functions  of  frequency  on  a 

®  rough  surface  with  o^.Sc/i2.  The  quantity  shown  is  proportionjil  to //.  The 

applied  field  is  .3kG,  d^=-45®,  and  damping  is  i/X2r=.0001.  For  this  <r,  we 

■  pick  up  contributions  from  the  surface  polaritons  nearer  the  light  line.  These 

polaritons  have  relatively  large  group  velocities  and  small  state  densities,  hence 
they  are  not  as  strong  scatters  as  the  shorter  wavelength  polaritons.  Note  the 
.  prominence  of  the  "it"  peaks  compared  to  the  surface  polariton  peaks. 


136 


In  figure  6.10  the  effect  of  (T  on  a  reflection  measurement  is  explored  Here 
the  reflectance  of  the  smooth  surface,  R,  is  shown  together  with  the  reflectance  of  the  rough 
surface,  R-M,  as  functions  of  6^  in  an  applied  field  of  .3kG.  The  frequency  is  .9989f2 
and  so  the  incident  wave  can  couple  with  both  the  +kj^  and  the  surface  mode  branches  of 
figure  5.  The  rms  height  is  /i=.003c/i3  (a  height  of  3iim  for  MnF2)  and  the  cases  a 
=.005c/£2  and  <T=i.Qlc/i2  are  presented  Note  that  with  this  small  damping  value,  the 
reflectance  from  the  smooth  surface  is  fairly  reciprocal  with  respect  to  6^.  The  rough 
surface,  however,  induces  nonreciprocity  by  coupling  the  incident  wave  more  strongly  with 
the  surface  polariton  modes. 

The  greatest  losses  occur  for  couplings  with  the  -k^  branch.  This  is  because  the  -  k^ 
branch  is  flatter  than  the  +  branch,  and  thus  has  a  greater  density  of  states  nearer  the  light  line 
than  the  +  k^  branch.  Not  only  has  the  change  in  reflectivity  increased  with  the  slightly 
rougher  surface,  as  expected  by  the  dependance  of  AR  ,  but  the  +  and  -  peaks  are  also 
more  pronounced 


137 


I  Figure  6.10. 

■ 

I 


e 


Reflectivity  as  a  function  of  Qq  for  a  smooth  surface,  a  rough  surface  with 
a=^\cin ,  and  a  rough  surface  with  o^.  Ic/XZ  There  is  an  applied  field  of  .3kG 
and  damping  is  7/f2r=.0001.  The  rms  height  is  /i«.003c/f2  The  frequency  of 
the  incident  wave  is  .9989kG  which  allows  coupling  to  both  the  +kv  and  the 
surface  polaiiton.  With  the  applied  field,  these  modes  have  different  group 
velocities.  The  largest  dip  is  due  to  the  surface  polaiiton  which  has  the 
greatest  density  of  states  nearest  the  light  line.  As  roughness  is  increased,  the 
nonreciprocal  reflection  losses  become  more  pronounced. 


CHAPTER? 

CONCLUSIONS 


In  this  wQik,  dectioniagnetic  excitations  on  semi-infinite  andfenomagnets  have  been 
examined  within  the  framework  of  a  phenomenological  theory  based  on  the  macroscopic 
Maxwell  equations.  The  problem  was  approached  &om  two  points  of  view;  (l)a 
description  of  the  possible  excitations  of  the  electromagnetic  Held  created  by  an 
antiferromagnet  through  dispersion  relations  obtained  by  solving  the  wave  equation,  and  (2) 
a  representation  depicting  the  propagation  of  electromagnetic  waves  on  the  antiferromagnet 
obtained  by  calculating  the  surface  response  functions  appropriate  to  the  semi-infmite 
geometry. 

Throughout  the  discussion,  the  focus  has  been  primarily  on  topics  of  interest  to 
possible  experimental  and  technological  application.  For  this  reason  the  coupling  between 
photons  and  polaritons  has  been  considered  in  detail  and  mathematical  models  have  been 
constructed  that  describe  possible  experimental  situations.  Four  investigations  were  carried 
out  The  first  concerned  the  possibility  of  nonreciprocal  reflection  from  antiferromagnets 
when  absorption  mechanisms  were  present  The  second  investigation  revealed  that  when 
damping  is  present  in  the  antiferromagnet  new  surface  excitations  may  exist  that  are  quite 
distinct  from  antifenomagnetic  surface  polaritons.  The  third  investigation  showed  that 
incident  light  could  excite  surface  excitations  by  scattering  off  a  sinusoidal  grating  ruled  on 
the  surface  of  the  antiferromagnet.  The  final  investigation  showed  how  nonreciprocal 
reflectance,  originating  this  time  in  nonreciprocal  absorption  by  surface  polaritons,  could  be 
enhanced  by  scattering  incident  light  from  a  randomly  rough  surface.  The  results  of  each  of 
these  four  inquiries  are  now  briefly  summarized. 


139 


7. 1  Bulk  polaritons  and  nonreciprocal  reflection. 

A  simple  thermodynamic  argument  shows  that  reflection  from  a  smooth 
antifenomagnet,  in  regions  where  the  material  is  optically  transparent,  can  be  nonieciprocal 
with  respect  to  incident  angle  even  when  the  material  is  non-absorptive.  By  considering  ail 
the  possible  polarizations  of  the  incident  and  reflected  light,  the  total  reflected  energy 
summed  over  all  polarization  states  is  in  fact  reciprocal  with  respect  to  propagation  angle. 
When  the  polarization  states  are  considered  individually,  however,  a  given  polarization  state 
may  have  a  highly  nonreciprocal  reflectance. 

In  a  numerical  example,  the  antifexromagnet  MnF2  displayed  nonreciprocal 
reflection  within  circularly  polarized  states;  i.e.,  the  reflectance  of  a  right  circularly  polarized 
incident  wave  changed  as  much  as  20%  between  the  incident  angles  0=45®  and  6=-45® 
when  the  antifenomagnet  was  in  an  applied  field  of  .3kG.  The  nonreciprocity  is  strongly 
dependant  on  the  frequency  of  the  incident  wave  and  is  greatest  near  the  antiferromagnetic 
resonance  frequencies.  In  some  frequency  regions,  the  nonreciprocity  will  reverse  with 
incident  angle  and  R(0)-R(-0)  will  change  sign. 

The  nonreciprocity  is  ultimately  due  to  a  propensity  of  the  incident  wave  to  couple 
most  strongly  with  bulk  polariton  modes  with  the  same  polarization.  The  bulk  modes  are 
right  and  left  circularly  polarized,  and  in  an  applied  field  bulk  waves  travelling  in  the  same 
direction  parallel  to  the  applied  field  with  different  polarizations  have  different  amplitudes. 
The  polarization  of  the  bulk  modes  is  strongly  influenced  by  applied  magnetic  fields  and  thus 
leads  to  nonreciprocal  transmittance,  and  consequently,  nonreciprocal  reflectance  of  the 
incident  wave. 

7,2  Leaky  surface  modes  on  antiferromagnets. 

When  damping  is  included  into  the  susceptibilities  governing  the  magnetic 
properties  of  the  antiferromagnet,  an  analysis  of  the  resulting  surface  polariton  dispersion 


140 


relation  show  surface  resonances  in  frequency  regions  forbidden  to  "true"  antiferromagnetic 
surface  polaritons.  These  excitations  are  analogous  to  the  leaky  modes  found  for  surface 
electromagnetic  waves  on  dielectrics  and  are  characterized  as  dissapative  waves  that  "leak" 
energy  from  the  surface  wave  into  the  interior  of  the  material. 

A  close  correspondence  is  found  between  these  antiferromagnetic  surface  resonances 
and  the  evanescent  Brewster  and  evanescent  modes  found  in  plasmon  polariton  studies.  The 
antiferromagnetic  modes  have  finite  path  lengths  and  can  be  highly  dissapative  with  large 
penetration  depths  into  the  material  The  penetration  depth  is  frequency  dependant  and 
sensitive  to  the  particular  type  of  damping  used  in  the  description  (i.e.,  Bloch  damping 
versus  Landau). 

The  surface  modes  with  damping  present  are  reciprocal  when  there  is  no  applied 
field.  In  an  applied  field,  their  reciprocity  is  frequency  dependant  and  they  are  nearly 
reciprocal  at  low  frequencies  within  the  lowest  bulk  band  but  become  nonreciprocal  at  higher 
frequencies  where  they  are  exist  in  the  "true"  surface  polariton  regime.  Also,  the  direction  of 
leaky  modes'  energy  flow  inside  the  material  is  extremely  sensitive  to  damping  and  applied 
fields  especially  near  the  antiferromagnetic  resonance  frequencies. 

7.3  Scattering  from  periodic  gratings. 

The  electromagnetic  Green’s  functions  for  a  semi-infinite  antiferromagnet  were 
calculated  for  the  special  case  of  propagation  perpendicular  to  the  easy  axis.  A  study  of  the 
peaks  of  the  green's  functions  revealed  the  expected  surface  excitations  and  a  correspondence 
between  the  poles  of  the  Green's  functions  and  the  surface  modes  was  established.  The 
Green's  functions  were  next  applied  to  a  perturbative  treatment  of  scattering  from  two  types 
of  surface  roughness:  a  periodic  grating  and  a  surface  with  random  roughness  in  one 
dimension. 

For  certain  choices  of  grating  periods  and  incident  angles,  a  periodic  grating  can 
induce  coupling  between  incident  electromagnetic  waves  and  surface  polaritons  by  creating 


141 


evanescent  fields  that  travel  parallel  to  the  surface  of  the  material.  These  evanescent  fields  are 
confined  to  the  surface  of  the  material  and  can  have  frequencies  and  wavevectors 
corresponding  to  surface  polariton  modes.  On  the  other  hand,  a  grating  can  also  diffract  the 
incident  waves  into  radiative  states  that,  when  damping  is  present  in  the  material,  can  couple 
to  the  leaky  Brewster-like  modes. 

7.4  Scattering  from  randomly  rough  surfaces. 

Random  roughness  creates  a  "width"  in  wavelength  analogous  to  the  width  in 
frequency  created  by  material  damping.  As  the  surface  becomes  rougher,  the  incident 
wave  couples  strongly  to  surface  modes  over  a  greater  range  of  wavelengths.  The  surface 
polaritons  have  a  greater  densi^  of  states  at  short  wavelengths,  and  increased  roughness  can 
lead  to  stronger  couplings  by  allowing  the  incident  wave  to  interact  with  short  wavelength 
surface  polaritons.  The  density  of  states  of  the  surface  polariton  modes  is  also  nonreciprocal 
with  respect  to  propagation  direction,  so  increased  roughness  in  a  reflection  experiment  can 
also  enhance  nonreciprocal  reflectivity  changes. 

7.5  Future  extensions. 

The  most  natural  extension  for  this  work  would  be  the  experimental  investigation  of 
the  phenomena  predicted  by  these  calculations.  Observation  of  the  leaky  modes  would  be 
particularly  interesting  and  the  catagorization  of  stable  frequency  regions  for  these  modes,  as 
discussed  in  Chapter  4,  could  be  used  to  gain  information  about  the  damping  mechanisms  in 
the  antiferromagneL  Observation  of  the  leaky  modes  at  different  temperatures  would  be 
particularly  interesting  since  different  types  of  damping  have  different  temperature 
dependances.  If  the  leaky  modes  were  primarily  due  to  interactions  within  the  spin  system, 
as  represented  phenomenologically  by  Landau  damping  terms,  they  should  be 


142 


observable  at  very  low  temperatures  where  magnon-phonon  interactions  are  negligible. 
Conversely,  if  the  leaky  modes  are  due  primarily  to  interactions  involving  the  vibrational 
modes  of  the  lattice,  as  described  phenomenologically  with  Bloch  damping  terms,  then  the 
leaky  modes  should  show  a  strong  temperature  dependance. 

Further  theoredcal  investigation  of  the  leaky  modes  is  also  in  order.  While  these 
excitations  are  not  true  eigenmodes  of  the  antiferromagnetic  system,  the  electromagnetic 
Green's  functions  can  be  used  to  approximately  describe  coupling  to  these  waves.  It  remains 
to  clarify  and  understand  this  approximation.  Since  the  approximation  requires  exponentially 
increasing  waves  to  obtain  leaky  mode  peaks  from  the  Green's  functions,  it  might  be 
interesting  to  calculate  the  exact  Green's  functions  for  a  bounded  geometry  that  would  admit 
exponentially  growing  waves  as  physical  representations.  The  inclusion  of  a  ground  plane  in 
the  vacuum  outside  the  antiferromagnet  would  be  the  simplest  system  to  consider. 

There  are  numerous  other  ways  to  expand  upon  the  theoretical  work  presented  in 
this  manuscript  For  example,  the  Green's  functions  could  be  calculated,  at  least 
numerically,  for  arbitrary  angles  of  propagation  with  respect  to  the  applied  field.  Since  the 
coupling  of  external  photons  to  bulk  polariton  modes  is  extremely  sensitive  to  the  wavevector 
component  of  the  incident  wave  parallel  to  the  applied  field  (k^),  perhaps  coupling  incident 
light  to  the  surface  excitations  is  also  strongly  dependent  on  k^.  This  might  be  particularly 
true  for  the  leaky  modes  since  they  are  often  largely  radiative  in  character.  It  would  thus  be 
interesting  to  examine  the  profiles  of  the  Green's  function  peaks  for  the  surface  excitations  as 
functions  of  k^. 

Finally,  there  remains  the  question  of  how  to  accurately  describe  the  scattering  of 
light  from  a  rough  surface.  An  important  enlargement  of  the  theory  would  be  to  include  into 
the  description  the  possibility  of  scattering  the  incident  light  into  different  polarization  states. 
This  would  involve  calculating  the  k^^O  Green's  functions,  however,  since  magnetic  fields 
in  the  z  direction  are  uncoupled  from  the  other  magnetic  fields  when  k2,=0. 


REFERENCES 


1.  See  the  book  Electromagnetic  Surface  Modes  edited  by  A.  D.  Boardman 
(Wiley  Interscience,  New  York,  1982). 

2.  G.  N.  Zhihin,  M.  A.  Moskalova,  A.  A.  Sigarev  and  V.  A.  Yakovelev,  Optics 
Communications,  ^  32  (1982). 

3.  See  the  review  article  by  E.  F.  Sarmento  and  D.  R.  Tilley  in  Reference  1. 

4.  A.  Hartstein,  E.  Burstein,  A.  A.  Maradudin,  R.  Brewer  and  R.  F.  Wallis,  J.  Phys.  C  ^ 
1266  (1973). 

5.  L.  Remer,  B.  Liithi,  H.  Sauer,  R.  Beick  and  R.  E.  Camley,  Phys.  Rev.  Lett 
2752  (1986).  There  is  some  earlier  work  on  bulk  polaritons  in  antiferromagnets  -- 
see  for  example  R.  W.  Sanders,  R.  M.  Belanger,  M.  Motokawa,  and  V.  Jaccarino 
Phys.  Rev.  B  22  1190  (1981). 

6.  A  review  of  nonreciprocal  surface  waves  is  given  by  R.  E.  Camley,  Surface  Science 
Reports,  2  103  (1987). 

7.  R.  E.  Camley,  Surface  Science  Reports  2  103(1987). 

8.  For  an  introductory  summary,  see  Waguik  S.  Ishak  and  Kok-Wai  Chang, 
Hewlett-Packard  J.10(Febuary  1985). 

9.  J.  E.  Sethares,  J.  Appl.  Phys.  52  2646(1982). 

10.  D.  M.  BoUe  and  S.  H.  Talisa,  IEEE  Trans.  Micro.  The.  Teck.  MTr-29, 916(1981). 

11.  S.  P.  Vernon,  R.  W.  Sanders,  and  A.  R.  King,  Phys.  Rev.  B 12  1460(1978). 

12.  K.  W.  Blazey,  H.  Rohrer,  and  R.  Webster,  Phys.  Rev.  B,  1 2287(1971). 

13.  R.  C.  Ohomann  and  M.  Tinkham,  Phys.  Rev.,  122  438(1961). 

14.  For  a  brief  description  of  damping  terms,  see  A.  H.  Morrish,  The  Physical  Principles  of 
Magnetism.  (Krieger  Publishing  Co.,  Mdaban,  Florida,  1983)  pgs.  549-551. 

15.  R.  L.  Stamps  andR.  E.  Camley,  Phys.  Rev.  B,  25  1919(1987). 


16.  R.  E.  Camley  and  D.  L.  Mills,  Phys.  Rev.  B  2fi,  1280  (1982)  and  C.  Shu  and  A. 
Caill6,Solid  State  Comm.  42  233  (1982). 


17.  Reference  14,  pg  617. 


144 


18.  R.  E.  Camiey,  N.E.  Glass,  and  A.  A.  Maradudin,  J.  AppL  Phys.  51, 3170(1982). 

19.  M.  Weber  and  D.  L.  Mills,  Phys.  Rev.  B,  22  2698(1983). 

20.  D.  L.  Mills,  Phys.  Rev.  B.,  IS.  3097(1977). 

21.  L.  Remer,  E.  Mohler,  W.  Grill  and  B.  Luthi,  Phys.  Rev.  B,  2Q  3277(1984). 

22.  P.  Halevi  in  reference  1  page  249. 

23.  C.  E.  Patton,  private  communication. 

24.  M.  G.  Cottam  and  A.  A.  Maradudin,  in  Surface  Excitations  ed.  by  V.  M.  Agranovich 
and  R.  Louden  (Elseiver  Science  Pub.  pages  1-194)  1984. 

25.  D.  L.  Mills,  Phys.  Rev.  B 12  4036  (1975). 

26.  A.  A.  Maradudin  and  D.  L.  Mills,  Phys.  Rev.  B  H  1892  (1975). 

27.  A.  A.  Maradudin  and  W.  Zierau,  Phys.  Rev.  B  M  484  (1976). 

28.  A.  A.  Maradudin  and  R.  F.  Wallis,  J.  Raman  Spectroscopy,  IQ  85  (1981). 

29.  G.  I.  Stegeman,  J.  J.  Burke  and  D.  G.  Hall,  Opt  Lett,  S  383(1983). 

30.  C.  W.  Hsue  and  T.  Tamir,  J.  Opt  Soc.  Am.  A,  1 923(1984). 

31.  T.  Tamir,  A.  A.  Oliner,  Proc.  DEE,  110  310(1963). 

32.  S.  Rice,  Comm.  Pure.  Appl.  Math,  H  351(1951). 

33.  E.  Kruger  and  E.  Kretshmann,  Z.  Physik,  222 1(1970). 

34.  J.  M.  Elson  and  R.  H.  Ritchie,  Phys.  Rev.  B,  1 4129(1971). 

35.  B.  Laks,  D.L.  Mills  and  A.  A.  Maradudin,  Phys.  Rev.  B,  22  4965(1981). 

36.  F.  Toigo,  A.  Marvin,  V.  Celli  and  N.  R.  HiU,  Phys.  REv.  B,I5. 5618(1977). 

37.  G.  S.  Agarwal,  Phys.  REv.  B,  14  846(1976). 

38.  N.  E.  Glass  and  A.  A.  Maradudin,  J.  Appl.  Phys.,  54  796(1983). 

39.  D.  L.  Mills  and  E.  Burstein,  Rep.  Prog.  Phys.  22  817  (1974), 

40.  R.  E.  Camiey,  Phys.  Rev.  Lett,  45  283  (1980). 


APPENDICES 


A.  Fresnel  Relations. 

The  amplitudes  of  the  driving  fields  are  written  in  terms  of  the  incident  wave's 
amplitude,  The  relations  are  derived  in  the  usual  manner  by  imposing  Maxwell's 
continuity  conditions  on  the  unperturbed  tangential  fields  and  the  tangential  fields. 
Using 


for  the  magnitude  of  the  free  space  wavevector,  and 


(A.1) 


(A.2) 

for  the  magnitude  of  the  wavevector  in  the  material,  the  amplitudes  of  the  component,  of  the 
transmitted  wave  are 


2e.k  k  k 

2  0  yo  y 

kh  +e,kh 

yo  2  0  y 


(A.3) 


for  the  X  component  and 


146 


kh  +eyk 

yo  2  o  y 


(A.  4) 


for  the  y  component 


Finally,  the  ratio  of  the  transmittance  to  the  reflectance  is  found  to  be 


T  I-  2e^kk  k 

i  I  ^  VA  I 


i=r 

R  L 


Ick  -ekh 
yo  20  1 


147 


B.  Green's  Functions 

When  the  souree  points  are  in  the  material  (y'>0),  the  Green's  functions  satisfy 
inhomogeneous  equations  for  y>0  (in  the  material)  given  by 


(B.1) 

and  homogeneous  equations  for  ycO  (outside  the  material)  given  by 

d“  +  <»2  -ik^D 

,  g(t,;M')  =  o  (B.2) 

-ikD  6)2-*^ 

^  0  X 


4ne*r^ 
g{k\y,y')^- — ) 


These  equations  can  be  uncoupled  and  solved  for  the  homogeneous  and  particular 
solutions  gjj,  just  as  in  the  case  where  y'<0.  Also,  one  can  derive  boundary  conditions,  in 
the  manner  described  in  chapter  5,  that  are  identical  to  those  of  equations  (5.34)  and  (5.43). 
With  the  definition 


C=-'y(iii(xsgn(y-y'>Hi2kx) 


(B.3) 


this  prescription  results  in  the  following  Green's  functions  for  y>0  and  y'>0: 


(B.4) 


148 


(B.5) 


6yy 


000%  ll  A 
0  IM 


-ctkj^sgn(y-y>‘^’^’^-sgn(y+y’ 


(B.6) 


2m 


syx 


cuiPz  a  *  °  ^  ^ 
0 


A— C|  -«fy+y)  -aiy-yi 
|e  -e 

VA-B 


fA-C 


+alf|  sgi*:y-y  '-sgn(y+y' 


} 


(B.7) 


For  y<0  and  y'>0, 


gxx= 


4reAB 

ao)^  ii 
0  1^1 


(ct) 


(B.8) 


0  IM 


(B.9) 


149 


(B.IO) 


ccyore  u. ' 

'  0  IM 


e^c-^' 


(B.ll) 


Finally,  as  discussed  in  secdon  5.1,  note  that  these  forms  are  appropriate  only  for 
the  case  a  and  y  are  real.  When  a  or  y  is  imaginary,  ±e  appropriate  forms  are  obtained  by 
letting  a  go  to  -ia  or  y  go  to  -iy  in  the  above  expressions.  Likewise,  when  damping  is 
present  and  a  and  y  are  complex,  the  correct  transformation  is  a  to  -ia*  and  y  to  -iy*. 


150 


C.  Landau-Lifshitz  damping  terms. 

The  form  of  the  Landau  damping  terms  which  appear  in  the  equations  of  motion  can 
be  obtained  from  the  free  energy  of  the  antiferromagneL  Using  the  definitions  of  the 
effective  fields  acting  on  each  sublattice  (equations  2.2  and  2.3),  the  free  energy  is  written  as 
the  sum  of  the  free  energies  of  each  sublattice: 

F=-Ma-I^-Mb-Hb  (Cl) 

In  the  calculation  of  the  susceptibilities  only  terms  to  first  order  in  the  transverse 
magnitizations  and  My  were  kept  The  same  approximation  of  small  precession  angles  in 
the  free  energies  must  be  made  to  second  order  in  and  My  for  consistency.  A  Taylor 

expansion  of  M^  about  Mjj=0  and  My=0  yields  the  relation 

(C.2) 

for  the  z  component  of  the  A  sublattice  magnetization  and 

(C3) 

for  the  z  component  of  the  B  sublattice  magnetization.  M  is  the  saturation  magnetization. 

These  expressions  are  substituted  into  the  free  energies  and  only  terms  to  second 

order  in  the  transverse  magnetizations  are  kept.  In  the  small  precession  angle  approximation, 

— > 

there  are  no  forces  in  the  z  direction.  Since  the  gradient  of  the  free  energy  with  respect  to  M 


151 


gives  the  effective  forces  acting  on  the  magnetizations,  the  gradient  in  the  direction 
vanishes.  The  transverse  gradient  for  the  A  sublattice  is  defined  as 

V,4-^  +  y-^ 

K  du; 

(C.4) 

The  transverse  gradient  for  the  B  sublattice,  Vj,,  is  defined  similarly.  Taking  V^F,  one 
finds: 


X(Mj+My>f(H„+i^^-hy 

(C.5) 

hj  and  Hg  are  the  anisotropy  and  applied  fields,  respectively.  With  C.5,  the  following 
equality  can  be  verified: 


A 

+y 


v/=-Li^  (Rxh  j 

M 


(C.6) 

A  similar  expression  can  be  obtained  for  the  B  sublattice  magnetizations.  The  Bloch 
equations  of  motion  with  Landau  damping  can  thus  be  written 


Ma=T^vIaXHa- AV^F 


(C.7) 


Mb=‘yMbxHb  -  AV|,F 


(C.8) 


SODIUM  UDAR  STUDIES  OF  THE  HORIZONTAL  VARIABILITY  OF 
GRAVriY  WAVES  IN  THE  MESOSPHERE 


BY 

KANGHYONKWON 

B.Sm  University  of  Illinois,.  1984 
M.S.,  University  of  Illinois,  1986 


THESIS 

Submitted  in  partial  fulfillment  of  the  requirements 
for  the  degree  of  Doctor  of  Philosophy  in  Electrical  Engineering 
in  the  Graduate  College  of  the 
University  of  Illinois  at  Urbana-Champaign,  1989 


Urbana,  Illinois 


ABSTRACT 


A  data  analysis  technique  for  deteimining  gravity  wave  intrinsic  parameters  is  presented. 
The  intrinsic  parameters  include  the  horizontal  and  vertical  wavelengths,  period,  and  wave 
propagation  direction.  The  technique  involves  measuring  the  altitude  variations  of  the  wave 
induced  density  perturbations  in  the  mesospheric  Na  layer.  The  intrinsic  horizontal  wavelength  of 
a  wave  observed  with  an  airborne  lidar  in  November  1986  was  estimated  to  be  about  85  km,  and 
the  vertical  wavelength  was  4.1  km.  The  intrinsic  period  was  about  1.7  hours,  and  the 
propagation  direction  was  almost  due  south. 

Kinetic  energy  horizontal  and  vertical  wavenumber  spectra  of  horizontal  winds  are  inferred 
from  Na  density  profiles  collected  with  the  airborne  lidar  during  two  flights  in  November  1986. 
The  two  flights  include  one  roundtrip  from  Denver,  Colorado  to  Springfield,  Illinois  and  another 
roundtrip  from  Denver  to  the  Pacific  Coast  The  horizontal  wavenumber  spectra  exhibited  an 
approximately  dependence  at  horizontal  scales  firom  70  to  700  km,  while  the  vertical 
wavenumber  spectra  exhibited  an  approximately  kt*2*7  dependence  at  vertical  scales  fixim  2  to  10 
km. 

In  November  1986,  joint  lidar/radar  observations  were  conducted-  The  lidar  observations 
include  the  airborne  observations  and  ground-based  observations  at  Broomfield  and  Denver, 
Colorado.  The  radar  observations  were  obtained  at  Platteville,  Colorado  with  an  ST  radar.  These 
joint  observations  revealed  that  waves  with  periods  of  approximately  6  hours  and  2  hours  were 
dominant  at  the  altitudes  that  correspond  to  the  bottomside  of  the  Na  layer. 

The  characteristics  of  sporadic  Na  layers  observed  at  Mauna  Kea  Observatory.  Hawaii 
(20°N,  155°W)  and  at  Nordlysstasjonen,  Svalbard,  Norway  (78®N,  16°£)  arc  also  described.  The 
layers  observed  in  Hawaii  formed  either  in  the  late  evening  or  in  the  early  morning.  The  mean 
time  of  the  maximum  peak  density  of  the  early  morning  layers  occurred  about  6  hours  after  that  of 
me  late  evening  layers.  Tne  mechanisms  responsible  for  creating  these  layers  appear  to  be  related 
to  diurnal  tides  and  sporadic  E  layers. 


iii 


DEDICATION 


To  my  father  and  mother 


iv 


ACKNOWLEDGMENT 


I  wish  to  express  my  sincere  appreciation  to  Professor  Chester  S.  Gardner  under  whose 
supervision  this  work  was  performed.  His  guidance,  friendship,  and  inspiration  have  been 
invaluable  to  me.  I  would  also  like  to  thank  Professor  C.  H.  Liu  for  many  helpful  suggestions 
during  the  course  of  my  academic  progress.  I  thank  Professor  Steve  Franke  for  his  comments 
and  for  serving  on  my  doaoral  committee,  and  I  thank  my  fellow  graduate  students  at  the  Electro¬ 
optics  Systems  Laboratory  for  their  encouragement  and  assistance. 

I  would  like  to  specially  thank  my  wife  Casey  for  her  unending  support  and 
encouragement 


V 


TABLE  OF  CONTENTS 


Page 

1.  INTRODUCTION .  1 

2 .  AIRBORNE  SODIUM  UDAR  MEASUREMENTS  OF  GRAVTTY  WAVE 

INTRINSIC  PARAMETERS .  5 

2.1  Introduction .  5 

2.2  Estimation  of  Gravity  Wave  Intrinsic  Parameters .  6 

2.3  Experimental  Data .  18 

2.4  Summary .  32 

3 .  AIRBORNE  SODIUM  LIDAR  OBSERVATIONS  OF  HORIZONTAL  AND 

VERTTCAL  WAVENUMBER  SPECTRA  OF  MESOPAUSE  DENSITY  AND 
WIND  PERTURBATIONS .  33 

3.1  Introduction .  33 

3.2  Layer  Density  Response .  34 

3.3  Description  of  the  Experiment .  41 

3.4  Results .  44 

3.5  Discussion .  79 

3.6  Summary .  82 

4.  CORRELATIVE  RADAR  AND  AIRBORNE  SODIUM  LIDAR 
OBSERVATIONS  OF  THE  VERTICAL  AND  HORIZONTAL  STRUCTURE 

OF  GRAVITY  WAVES  AND  TIDES  NEAR  THE  MESOPAUSE .  84 

4.1  Introduction .  84 

4.2  Description  of  the  Experiment .  85 

4.2.1  Radar .  85 

4.2.2  Lidar .  86 

4.3  Radar  Observations .  86 

vi 


Page 

4.4  Ground-based  Lidar  Observations .  91 

4.5  Airborne  Lidar  Observations .  109 

4.6  Summary .  112 

5 .  UDAR  OBSERVATIONS  OF  SPORADIC  SODIUM  LAYERS  AT  MAUNA 

KEA  OBSERVATORY,  HAWAII .  115 

5.1  Introduction .  115 

5.2  Observations .  116 

5.3  Discussion .  134 

5.4  Summary .  139 

6.  UDAR  OBSERVATIONS  OF  THE  SODIUM  LAYER  AT  SVALBARD, 

NORWAY .  143 

6.1  Overviews .  143 

6.2  Lidar  Observations  at  Svalbard  in  July  and  September,  1986 .  147 

6.3  Lidar  Observations  at  Svalbard  in  November,  1987  and  January,  1988 .  155 

7.  CONCLUSIONS  AND  RECOMMENDATIONS .  167 

7.1  Conclusions .  167 

7.2  Recommendations  for  Future  Work .  170 

APPENDDC  I.  RMS  ERRORS  IN  GRAVITY  WAVE  INTRINSIC 
PARAMETERS  FOR  AIRBORNE  LIDAR  OBSERVATIONS  OVER  A 

CIRCULAR  FLIGHT  PATH .  172 

APPENDDC  n.  RMS  ERRORS  IN  GRAVITY  WAVE  INTRINSIC 
PARAMETERS  FOR  MULTIPLE  GROUND-BASED  LIDAR 

OBSERVATIONS . 175 

APPENDIX  m.  CHARACTERISTICS  OF  SPORADIC  SODIUM  LAYERS 
OBSERVED  AT  MAUNA  KEA  OBSERVATORY,  HAWAH .  178 


vii 


Page 

REFERENCES . . .  182 

VITA .  188 


viii 


LIST  OF  TABLES 


Tables  Page 

2. 1  The  Parameters  of  the  UIUC  Na  Lidar  System  and  NCAR  Electra  Aircraft...  19 

2.2  Summary  of  the  Flights  and  System  Performance .  21 

2.3  Intrinsic  Parameters  of  the  Dominant  Wave  Observed  During  the  Westward 

Flight  on  November  17-18,  1986 .  27 

3. 1  Horizontal  Velocides  of  the  Maxima  and  Minima  of  the  Centroid  Height  of 
the  Na  Layer  Observed  During  the  Eastward  Flight  on  November  15-16, 

1986 .  55 

3 . 2  Monochromatic  Wave  Parameters  Estimated  ftom  the  Horizontal 

Wavenumber  Spectra  for  the  Eastward  Flight  on  November  15-16, 1986 .  61 

3.3  Summary  of  the  Horizontal  and  Vertical  Wavenumber  Spectra  Parameters 

Measured  on  the  Eastward  and  Westward  Flights .  74 

4. 1  Summary  of  the  Airborne  and  Ground-based  Na  Lidar  Observations .  87 

5. 1  Statistics  of  Na  Sporadic  Layers  Observed  at  Mauna  Kea,  Hawaii .  1 17 

5 . 2  Comparison  of  Sporadic  Na  Layers  Observed  in  the  Late  Evening  and  Early 

Morning  at  Mauna  Kea,  Hawaii .  132 

5.3  Characteristics  of  the  Most  Prominent  Sporadic  Na  Layers  Reported  in 

Literature .  133 

6.1  Sodium  Lidar  Observation  Times  at  Svalbard  (1987-1988) .  144 

6.2  Sodium  Layer  Parameters  Measured  at  Nordlysstasjonen  (78®12'N, 

15°15'E),  Svalbard  During  the  July  and  September  1987  Campaigns .  153 


LISTOFHGURES 

Figures  Page 

2. 1  The  relationship  between  the  horizontal  components  of  the  wave 

propagation  direction  and  background  atmospheric  wind .  9 

2.2  Ground  track  of  the  circular  flight  path  with  radius  R .  13 

2.3  Configuration  for  three  ground-based  lidars  located  at  the  comers  of  an 

equilateral  triangle  with  sides  of  length  R .  16 

2.4  Ground  tracks  of  the  three  flights  of  the  airborne  Na  lidar  experiment  in 

November,  1986.  The  flights  were  conducted  out  of  Denver,  Colorado .  20 

2.5  Sodium  density  profiles  collected  during  the  westbound  and  eastbound  legs 

of  the  westward  flight  on  November  17-18,  1986.  The  profiles  have  been 
filtered  vertically  with  a  cutoff  of  3  km  and  horizontally  with  a  cutoff  of  50 
km.  The  profiles  have  been  also  normalized  so  that  each  has  the  same 
column  abundance,  and  are  plotted  on  a  linear  scale .  22 

2.6  The  altitude  variations  of  a  local  Na  density  maximum  and  minimum 
measured  during  the  westward  flight  on  November  17-18, 1986.  Circles 
represent  the  altitudes  of  the  density  maximum,  and  crosses  represent  the 
altitudes  of  the  density  minimum.  The  solid  lines  represent  the  least- 

squares  fitted  altitudes  of  the  density  maximum  and  minimum .  24 

2.7  Average  vertical  wavenumber  power  spectrum  of  the  Na  density  profiles 

collected  from  0131  to  0138  MST  during  the  westward  flight  on  November 
17-18,  1986 .  25 

2.8  a)  Horizontal  wavenumber  spectra  computed  for  the  bottomside  and  topside 

of  the  Na  layer  measured  during  the  westbound  leg  of  the  westward  flight 
on  November  17-18, 1986.  The  data  were  filtered  vertically  with  a  cutoff 
of  1  km  before  the  spectra  were  computed .  29 


X 


Hgures  Page 

2.8  b)  Horizontal  wavenumber  spectra  computed  for  the  bottomside  and  topside 
of  the  Na  layer  measured  during  the  eastbound  leg  of  the  westward  flight 
on  November  17-18, 1986.  The  data  were  filtered  vertically  with  a  cutoff 
of  1  km  before  the  spectra  were  computed .  30 

3.1  A  photograph  of  the  NCAR  Electra  aircraft .  42 

3.2  A  photograph  of  the  interior  of  the  NCAR  Electra  showing  the  lidar 

installation .  43 

3 . 3  Sodium  density  profiles  collected  during  the  eastbound  leg  of  the  eastward 
flight  on  November  15, 1986.  The  profiles  were  filtered  vertically  with  a 
cutoff  of  4  km  and  horizontally  with  a  cutoff  of  70  km.  The  density 
profiles  were  also  normalized  so  that  each  had  the  same  column  abundance 

and  were  plotted  on  a  linear  scale .  45 

3.4  Vertical  and  longitudinal  variations  of  the  relative  density  perturbations 

computed  fix^m  the  Na  lidar  data  collected  during  the  eastbound  leg  of  the 
eastward  flight  on  November  15, 1986.  Before  the  data  were  computed, 
the  density  profiles  were  filtered  vertically  with  a  cutoff  of  1  km  and 
horizontally  with  a  cutoff  of  70  km.  Then  the  relative  density  perturbations 
were  computed  and  filtered  horizontally  with  a  cutoff  of  180  km .  46 

3 . 5  Horizontal  wavenumber  spectrum  for  the  eastbound  leg  of  the  eastward 

flight  on  November  15, 1986.  The  data  were  filtered  vertically  with  a 
cutoff  of  1  km  before  the  spectram  was  computed.  The  straight  line  is  a 
linear  regression  fit  which  was  used  to  estimate  the  spectral  slope  over 
horizontal  scales  from  70  to  700  km .  47 

3 . 6  Vertical  wavenumber  spectram  for  the  eastbound  leg  of  the  eastward 
flight  on  November  15, 1986.  The  data  were  filtered  horizontally  with  a 


xi 


Figures 


Page 


cutoff  of  70  km  before  the  spectmm  was  computed.  Then  the  shot  noise 
level  was  esdmated  and  subtracted  from  the  spectmm  before  the  spectral 
slope  was  estimated.  The  straight  line  is  a  linear  regression  fit  which  was 
used  to  estimate  the  spectral  slope  over  vertical  scales  from  2  to  10  km .  49 

3.7  Sodium  density  profiles  collected  during  the  westbound  leg  of  the  eastward 
flight  on  November  15-16, 1986.  The  profiles  were  processed  in  the  same 
manner  as  those  of  the  eastbound  leg  of  the  eastward  flight  plotted  in 

Figure  3.3 .  50 

3 . 8  Vertical  and  longitudinal  variations  of  the  relative  density  perturbations 
computed  from  the  Na  lidar  data  collected  during  the  westbound  leg  of  the 
eastward  flight  on  November  15-16, 1986.  To  compute  these  relative 
density  perturbations,  the  Na  profiles  were  first  filtered  vertically  with  a 
cutoff  of  1  km  and  horizontally  with  a  cutoff  of  70  km.  Then  the  relative 
density’  perturbations  were  computed  and  filtered  horizontally  ’vith  a  cutoff 


of  233  km .  51 

3.9  Horizontal  wavenumber  spectmm  for  the  westbound  leg  of  the  eastward 

flight  on  November  15-16, 1986.  The  data  were  processed  in  the  same 
manner  as  the  data  of  the  eastbound  leg  of  the  eastward  flight .  53 

3.10  The  longitudinal  variations  of  the  centroid  height  of  the  Na  layer  observed 

during  the  eastward  flight  on  November  15-16,  1986 .  54 

3.11  The  horizontal  wavelengths  of  the  waves  observed  with  the  ground-based 
(after  Gardner  and  Voelz,  1987)  and  airborne  Na  lidars  are  plotted  versus 
period.  Crosses  repre'.ent  the  ground-based  observations  obtained  at 
Urbana,  Fiiinois  during  the  winter  seasons  from  1980  to  1986.  Circles 
represent  the  intrinsic  zonal  wavelengths  and  intrinsic  periods  measured 


xii 


Rgurcs 


Page 


during  the  eastward  flight  on  November  15*16,  1986.  The  straight  line  is 
a  maximum  likelihood  linear  regression  fit  which  was  used  to  estimate  the 

slope  for  the  ground-based  observations .  62 

3.12  The  kinetic  energy  of  quasi-monochromatic  gravity  waves  plotted  versus 

temporal  frequency.  Crosses  represent  the  ground-based  Na  iidar 
observations  obtained  at  Urbana,  Illinois  during  the  winter  seasons  from 
1980  to  1986  (after  Gardner  and  Voelz,  1987).  Circles  represent  the 
kinetic  energy  distribution  for  the  quasi-monochromatic  waves  measured 
during  the  eastward  flight  on  November  15-16, 1986.  The  shaded  circle 
is  the  data  measured  during  the  westbound  leg,  and  the  open  circles  are  the 
data  measured  during  the  eastbound  leg.  The  straight  line  represents  a 
maximum  likelihood  linear  regression  fit  for  the  ground-based 


measurements .  63 

3.13  Vertical  wavenumber  spectrum  for  the  westbound  leg  of  the  eastward 

flight  on  November  15-16,  1986 .  65 

3. 14  a)  Horizontal  wavenumber  spectrum  for  the  eastbound  leg  of  the  westward 

flight  on  November  17-18,  1986 .  66 

3.14  b)  Horizontal  wavenumber  spectrum  for  the  westbound  leg  of  the  westward 

flight  on  November  17-18,  1986 .  67 

3.15  a)  Vertical  wavenumber  spectrum  for  the  eastbound  leg  of  the  westward  flight 

on  November  17-18,  1986 .  68 

3.15  b)  Vertical  wavenumber  spectrum  for  the  westbound  leg  of  the  westward  flight 

on  November  17-18,  1986 .  69 

3.16  The  rms  horizontal  wind  velocities  inferred  from  the  airborne  Na  lidar  data 
collected  during  the  eastward  (November  15-16, 1986)  and  westward 


xiii 


Rgures  Page 

(November  17-18,  1986)  flights .  70 


3.17  The  ims  horizontal  wind  velocities  inferred  from  the  airborne  Na  lidar  data, 
and  from  the  ground-based  Na  lidar  data  obtained  at  Mauna  Kea 
Observatory  (MKO),  Hawaii  (20®N,155®W),  NCAR  in  Broomfield, 
Colorado  (40“N,  105®W).  UIUC,  Illinois  (40“N,  88“W),  and  Goddard 
Space  Flight  Center  (GSFQ,  Maryland  (39“N,  78“W).  The  dots  and  lines 
for  the  measurements  at  MKO,  NCAR,  and  GSFC  indicate  the  average 
values  and  the  ranges  of  the  measured  values,  respectively.  The  dot  for 
the  UIUC  data  represents  a  regression  fitted  value  for  the  middle  of 
November.  The  regression  fit  was  performed  over  the  data  obtained  from 
1984  to  1986.  The  error  bar  for  the  UIUC  data  indicates  the  rms  difference 


between  the  regression  fit  and  the  measured  values .  72 

3.18  a)  Horizontal  wavenumber  spectra  computed  for  the  bottomside  and  topside 
of  the  Na  layer  measured  during  the  eastbound  leg  of  the  eastward  flight 

on  November  15-16,  1986 .  75 

3. 18  b)  Horizontal  wavenumber  spectra  computed  for  the  bottomside  and  topside 
of  the  Na  layer  measured  during  the  westbound  leg  of  the  eastward  flight 

on  November  15-16,  1986 .  76 

3  18  c)  Horizontal  wavenumber  spectra  computed  for  the  bottomside  and  topside 
of  the  Na  layer  measured  during  the  eastbound  leg  of  the  westward  flight 
on  November  17-18,  1986 .  77 


3. 18  d)  Horizontal  wavenumber  specti*a  computed  for  the  bottomside  and  topside 
of  the  Na  layer  measured  during  the  westbound  leg  of  the  westward  flight 

on  November  17-18,  1986 .  78 

4. 1  Vertical  profiles  of  a)  the  meridional  component  and  b)  the  zonal 


xiv 


Rgures 


Page 


component  of  average  background  wind  velocity  measured  with  Platteville 
ST  radar  during  the  period  from  November  4  to  20, 1986.  The  data  were 
provided  by  the  University  of  Colorado .  88 

4.2  Diurnal  variador  of  the  horizontal  wind  perturbations  measured  with  the 

Platteville  ST  radar  during  the  period  from  November  4  to  20, 1986.  The 
boxes  represent  the  hourly  velocity  measurements,  and  the  solid  curve 
represents  a  sinusoidal  least-square  fit  The  data  were  provided  by  the 
University  of  Colorado .  89 

4 . 3  Vertical  profiles  of  a)  the  meridional  component  and  b)  the  zonal 
component  of  the  6-hour  period  horizontal  winds.  The  data  were  provide 

by  the  University  of  Colorado .  90 

4.4  Sodium  density  profiles  measured  on  the  night  of  November  19-20, 1986. 

The  profiles  have  been  filtered  vertically  with  a  cutoff  of  3  km  and 
temporally  with  a  cutoff  of  50  min.  The  profiles  have  been  normalized  so 
that  each  has  the  same  column  abundance,  and  are  plotted  on  a  linear  scale 


at  10  min  intervals .  92 

4.5  Temporal  variations  of  the  layer  centroid  height,  rms  width,  and  column 

abundance  measured  on  November  19-20,  1986 .  93 

4.6  a)  Temporal  frequency  spectrum  computed  for  the  whole  layer  from  data 

collected  on  November  19-20, 1986.  The  Na  density  profiles  were 


vertically  filtered  with  a  cutoff  of  1  km  before  the  spectra  were  computed. 

The  straight  line  is  a  linear  regression  fit  which  was  used  to  estimate  the 

spectral  slopes  over  temporal  scales  from  30  to  360  min .  95 

4.6  b)  Temporal  frequency  spectra  computed  for  the  layer  bottomside  and  topside 
from  data  collected  on  November  19-20, 1986.  The  Na  density  profiles 


XV 


Hgures 


Page 


were  vertically  filtered  with  a  cutoff  of  1  km  before  the  spectra  were 
computed.  The  straight  lines  are  linear  regression  fits  which  were  used  to 
estimate  the  spectral  slopes  over  temporal  scales  from  30  to  360  min .  96 

4.7  Relative  temporal  variations  of  the  Na  density  fiom  81  to  92  km  measured 

on  November  19-20, 1986.  The  Na  density  profiles  were  vertically  filtered 
with  a  cutoff  of  5  km  and  temporally  with  a  cutoff  of  90  min.  The  diagonal 
lines  indicate  apparent  vertical  phase  progressions.  The  estimated  phase 
velocity  is  1.8  ra  s'^ .  97 

4.8  The  vertical  wind  velocity  estimated  at  the  altimde  of  82  km.  TheNa 

density  profiles  were  initially  filtered  vertically  with  a  cutoff  of  5  km  and 
temporally  with  a  cutoff  of  60  min .  99 

4.9  The  vertical  wind  velocity  estimated  at  the  altitude  of  101  km.  The  Na 

density  profiles  were  initially  filtered  vertically  with  a  cutoff  of  5  km  and 
temporally  with  a  cutoff  of  60  min .  100 

4.10  The  average  vertical  wavenumber  power  spectrum  computed  for  the  Na 
density  profiles  collected  from  2300  to  0100  MST  on  November  19-20, 

1986 .  101 

4. 1 1  The  rms  horizontal  wind  velocity  inferred  from  the  Na  density  profiles 

collected  on  November  19-20, 1986.  The  profiles  were  initially  filtered 
vertically  with  a  cutoff  of  1  km  .'Uid  temporally  with  a  cutoff  of  20  min .  102 

4.12  Temporal  variations  of  the  altitudes  of  local  Na  density  maxima  measured 

on  November  19-20, 1986.  The  Na  density  profiles  were  initially  filtered 
vertically  with  a  cutoff  of  3  km  and  temporally  with  a  cutoff  of  50  min .  105 

4. 1 3  Tne  average  vertical  wavenumber  power  spectrum  computed  for  the  Na 
density  profiles  collected  from  0300  to  0625  MST  on  November  20, 1986..  107 


XV  i 


Figures  Page 

4. 14  Vertical  wavenumber  spectram  measured  on  November  19-20, 1986.  The 
Na  density  profiles  were  initially  filtered  temporally  with  a  cutoff  of  20  min. 

The  straight  line  is  a  linear  regression  fit  which  was  used  to  estim^.te  the 
spectral  slope  over  vertical  scales  ranging  from  2  to  10  km .  108 

4. 15  a)  Longitudinal  variations  of  the  rms  width  of  the  Na  layer  observed  during 

the  eastward  flight  on  November  15-16,  1986 .  110 

4. 15  b)  Longitudinal  variations  of  the  column  abundance  of  the  Na  layer  observed 

during  the  eastward  flight  on  November  15-16,  1986 .  Ill 

4. 1 6  The  average  vertical  wavenumber  power  spectrum  computed  for  the  Na 
density  profiles  coUeaed  during  the  westbound  leg  of  the  eastward  flight  in 

the  longitude  range  from  96°W  to  102®W .  113 

5. 1  Sodium  density  profiles  measured  at  a)  2133  LST,  b)  2218  LST,  and  c) 

2224  LST  at  Mauna  Kea  Observatory  on  January  21,  1987 .  118 

5.2  Sodium  density  profile  measured  at  2306  LST  on  January  21, 1987  at 

Mauna  Kea  Observatory .  120 

5.3  Temporal  variation  of  the  altitude  of  the  dominant  sporadic  Na  layer 

observed  on  January  21-22, 1987  at  Mauna  Kea  Observatory.  The  straight 
line  is  a  linear  regression  fit  which  was  used  to  estimate  the  average  vertical 
velocity .  121 

5.4  Sodium  density  profile  measured  at  0407  LST  on  January  22,  1987  at 

Mauna  Kea  Observatory . 122 

5 . 5  Temporal  variation  of  the  density  at  the  peak  of  the  dominant  sporadic  Na 

layer  observed  on  January  21-22, 1987  at  Mauna  Kea  Observatory .  123 

5 . 6  Temporal  variations  of  me  widths  of  the  dominant  sporadic  Na  layer 

observed  on  January  21-22,  1987  at  Mauna  Kea  Observatory .  125 


XV  il. 


Page 


Rgurcs 

5.7  Temporal  variation  of  the  column  abundance  measured  on  January  21-22, 

1987  at  Mauna  Kea  Observatory .  126 

5.8  Temporal  variations  of  the  peak  density  and  altitude  of  the  high  altitude 

sporadic  Na  layer  observed  on  January  21, 1987  at  Mauna  Kea 
Observatory .  127 

5.9  Temporal  variation  of  the  altitude  of  the  short-lived  sporadic  Na  layer 

observed  on  January  21,  1987  at  Mauna  Kea  Observatory.  Crosses 
represent  the  four  measurement  points .  129 

5.10  Temporal  variations  of  the  altitudes  of  the  sporadic  Na  layers  measured 
from  January  18  to  22,  1987  at  Mauna  Kea  Observatory.  Dots  represent 
the  times  and  altitudes  of  the  maximum  peak  densities.  Squares,  triangles. 


circles,  and  diamonds  represent  starting  times  and  altitudes.  The  sporadic 

layers  are  numbered  in  accordance  with  Appendix  3 .  130 

5. 1 1  Sodium  density  profiles  measured  at  0256  LST  on  January  1 8, 0301  LST 

on  January  20,  and  0255  LST  on  January  22,  1987 .  136 


5.12  Temporal  variations  of  the  altitudes  of  local  Na  density  maxima  on  January 

21-22,  1987  at  Mauna  Kea  Observatory.  The  measurement  times  of  the 
density  profiles  of  Figures  5.1b),  5.1c),  5.2,  and  5.4  are  marked  on  the 
bottom  of  this  figure .  138 

5.13  Temporal  variations  of  the  altitudes  of  local  Na  density  maxima  and  minima 
from  2100  to  2330  LST  on  January  21, 1987  at  Mauna  Kea  Observatory. 

The  solid  curves  represent  the  altitudes  of  the  sporadic  layers,  the  dashed 
curves  the  local  density  maxima,  and  the  dotted  curves  the  local  density 
minima.  The  sporadic  layers  are  numbered  in  accordance  with  Table  5.2 
The  measurement  times  of  the  density  profiles  of  Figures  5.1a),  5.1b), 


xviii 


Figures  Page 

5.1c),  and  5.2  are  marked  on  the  bottom  of  this  figure .  140 


6. 1  Seasonal  vaiiadons  of  Na  column  abundance  measured  a)  at  Svalbard, 

Norway  (78®N)  and  b)  at  Urbana,  Illinois  (40®N).  Symbols  denote 
average  values  for  the  observadon  period,  while  lines  denote  range  of 

values .  145 

6.2  Seasonal  variadons  of  Na  layer  centroid  height  measured  a)  at  Svalbard, 

Norway  (78°N)  and  b)  at  Urbana,  Illinois  (40°N).  Symbols  denote 
average  values  for  the  observadon  period,  while  lines  denote  range  of 

values .  146 

6.3  Seasonal  variadons  of  Na  layer  rms  width  measured  a)  at  Svalbard, 

Norway  (78®N)  and  b)  at  Urbana,  Illinois  (40®N).  Symbols  denote 
average  values  for  the  observadon  period,  while  lines  denote  range  of 

value- .  148 

6.4  Average  Na  density  profile  above  Nordlysstasjonen,  Svalbard  during  the 

period  July  10-15, 1987.  The  data  were  smoothed  using  modified 
Hamming  window  with  FWHM  =  6  km .  150 

6.5  Sodium  density  profile  above  Nordlysstasjonen,  Svalbard  at  2152  UT 

September  7, 1987.  The  integradon  period  was  14  min,  and  the  data  were 
spatially  smoothed  using  a  low-pass  filter  with  a  cutoff  frequency  of  0.6 
km-i .  151 

6.6  Sodium  density  profile  above  Nordlysstasjonen,  Svalbard  at  midnight 
September  9, 1987.  The  integradon  period  was  40  min,  and  the  data  were 
spatially  smoothed  using  a  low-pass  filter  with  a  cutoff  frequency  of  0.6 


6.7  a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  Na  column 


xix 


Figures 


Page 


abundance  measured  during  the  57-hour  period  starting  at  1500  LST  on 
November  7,  1987 .  156 

6.8  a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  Na  layer 
centroid  height  measured  during  the  57-hour  period  starting  at  1500  LST 

on  November  7,  1987 .  157 

6.9  a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  Na  rms 

width  measured  during  the  57-hour  period  starting  at  1500  LST  on 
November  7.  1987 .  158 

6.10  a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  vertical 


winds  at  the  altitude  of  98  km.  The  vertical  winds  were  inferred  from  the 
temporal  variations  of  the  Na  density  gradients  on  the  layer  topside 
measured  during  the  57-hour  period  starting  at  1500  LST  on  November  7, 

1987 .  159 


6. 1 1  a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  Na  column 
abundance  measured  during  the  72-hour  period  starting  at  local  midnight 

on  January  10,  1988 .  160 

6.12  a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  Na  centroid 

height  measured  during  the  72-hour  period  starting  at  local  midnight  on 
January'  10,  1988 .  161 

6.13  a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  Na  rms 

width  measured  during  the  72-hour  period  starting  at  local  midnight  on 
January  10,  1988 .  162 

6. 1 4  a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  vertical 


id  ai  (uuiuuc  Oi  7o  k 


The  vertical  winds  were  inferred  from  the 


temporal  variations  of  the  Na  density  gradients  on  the  layer  topside 


XX 


Hgures 


Page 


measured  during  the  72-hour  period  starting  at  local  midnight  on  January 
10,  1988 .  163 

6. 1 5  Examples  of  sporadic  Na  layers  observed  at  Svalbard  a)  on  November  9, 

1987  and  b)  on  January  10,  1988 .  165 

6. 1 6  a)  A  bifurcated  sporadic  Na  layer  observed  at  Svalbard  on  November  9, 

1988,  and  b)  the  temporal  evolution  of  the  sporadic  Na  layer. .  166 


1.  INTRODUCTION 


The  atmospheric  Na  layer  is  generally  confined  to  an  altitude  region  between  80  and  1 10 
km,  which  is  the  transidon  region  from  the  mesosphere  and  thermosphere.  The  mesopause 
characterizes  the  boundary  between  the  mesosphere  and  thermosphere,  and  is  typically  the 
coldest  region  of  the  atmosphere.  The  aldtude  of  the  mesopause  is  nearly  90'95  km,  and  the 
maximum  Na  density  occurs  near  the  same  aldtude.  Because  this  region  is  reladvely  inaccessible 
to  in-sim  measurement  techniques,  it  has  remained  as  the  least  understood  region  of  the  earth's 
atmosphere.  For  the  past  two  decades,  Na  lidar  techniques  have  been  developed  to  explore  this 
region.  In  recent  years,  these  Na  lidar  techniques  have  matured  to  the  extent  that  they  are  now 
providing  very  important  informadon  on  the  dynamics  and  temperature  structure  of  this  region. 

Studies  of  the  Na  layer  began  when  Slipher  [1929]  discovered  nighttime  spectral 
emission  at  the  resonance  wavelength  of  the  Na  D2  line  of  589  nm,  which  is  the  result  of  the 
relaxadon  of  the  excited  atmospheric  Na  atoms  to  neutral  atoms.  In  the  1950s  and  1960s, 
observadons  of  resonandy  scattered  sunlight  at  the  Na  wavelength  defined  the  diurnal  and 
seasonal  variadons  in  column  abundance  [Blamont  and  Donahue,  1961;  Gadsen  and  Purdy, 

1970;  Burnett  et  ai,  1975],  and  rocketbome  dayglow  measurements  discovered  the  sharp  upper 
and  lower  boundaries  of  the  layer  [Hunten  and  Wallace,  1967;  Donahue  and  Meier,  1967]. 

Lidar  observadons  of  the  Na  layer  were  first  made  in  England  following  the  invendon  of 
the  tunable  dye  laser  in  the  late  1960s  [Bowman  et  al.,  1969].  Because  of  the  excellent  temporal 
and  verdcal  rcsoludon,  lidar  observadons  have  revealed  detailed  informadon  about  the  layer.  The 
verdcal  density  profile  of  the  layer  resembles  approximately  the  Gaussian  distribution  shape  with 
a  peak  density  of  lO^-lO'*  cm'^  near  90-95  km,  and  the  width  is  about  10  km  Full  Width  at  Half 
Maximum  (FWHM).  The  major  source  of  Na  is  now  believed  to  be  meteoric  abladon,  and  the 
major  sink  mechanism  is  chemical  reactior  on  the  bottomside  including  the  neutral  reaction  Na  + 
O2  +  M  ^  NaOi  +  M  [Swider,  1985]. 


1 


The  first  lidar  study  of  the  se?''onai  variations  of  the  Na  layer  was  made  by  Gibson  and 
Sandford  [1971]  at  Winkfield,  England  (51®N,  1®W).  Similar  studies  have  also  been  conducted 
by  Megie  and  Blamont  [1977]  at  Haute  Provence,  France  (44®N,  6®E),  by  Simonich  etal.  [1979] 
at  Sao  Paulo,  Brazil  (23®S,  46®W),  by  Gardner  etal.  [1986]  at  Urbana,  EL  (40®N,  88®W),  and  by 
Tilgner  and  von  Zahn  [1988]  at  Andoya,  Norway  (69®N,  16®E).  The  Na  column  abundance  at 
mid-latitudes  in  the  Northern  Hemisphere  varies  from  a  summer  minimum  of  about  3x10^ 
cm‘2  to  a  winter  maximum  of  about  10^°  cm*^  in  December  and  January.  The  seasonal  and 
geographical  variations  in  Na  abundance  at  mid-  and  low-  latitudes  are  now  believed  to  be  related 
to  changes  in  the  mesopause  temperature  that  affect  the  reaction  rates  of  the  main  chemical  loss 
processes  for  Na  [Swider,  1985;  Jegou  et  ai,  1985b]. 

High  resolution  temperature  profiles  have  been  obained  also  in  recent  years  by  lidar 
probing  of  the  hyperfine  stracture  of  the  Na  D2  line  [Fricke  and  von  Zahn,  1985;  von  Zahn  and 
Neither,  1987].  The  Na  resonance  line  is  actually  a  Doppler  broadened  doublet  whose  shape  is  a 
strong  function  of  temperature.  Temperature  can  be  infened  by  using  a  narrowband  frequency 
scanning  lidar  to  measure  the  Na  line  shape. 

Gravity  waves  and  tides  are  now  widely  recognized  to  play  a  major  role  in  determining 
the  large-scale  circulation  and  structure  of  the  middle  atmosphere.  Gravity  waves  are  believed  to 
be  the  major  source  of  energy  transport  between  the  lower  and  upper  atmosphere.  Turbulence 
caused  by  breaking  waves  has  a  significant  influence  on  the  transport  of  minor  species  and  on  the 
thermal  and  density  structure  of  the  mesosphere  and  the  thermosphere  [Lindzen,  1981;  Holton, 
1982  and  1983;  Fritts  et  al„  1984;  Fritts,  1984]. 

The  Na  layer  has  proven  to  be  an  excellent  tracer  of  wave  motions.  Early  lidar 
measurements  of  the  Na  profile  did  reveal  wavelike  perturbations  that  were  attributed  to  acoustic 
gravity  waves  [Rowlett  et  al.,  1978;  Juramy  et  ai,  1981].  But  it  was  not  until  the  work  of  Chiu 
and  Ching  [1978]  and  Gardner  and  Shelton  [1985]  that  the  interaction  of  gravity  waves  with  the 
layer  was  sufficiently  well  understood  that  the  wave  parameters  could  be  inferred  from  lidar 


2 


measurements  of  the  Na  profile.  The  most  extensive  lidar  study  of  gravity  wave  events  was 
recendy  published  by  Gardner  and  Voeh  [1987].  During  34  nights  of  measurements  at  Urbana, 
IL  between  December  1980  and  May  1986,  a  total  of  171  monochromatic  gravity  waves  were 
observed  in  the  Na  layer.  From  these  measurements,  Gardner  and  Voelz  [1987]  found 
surprisingly  systematic  relationships  between  horizontal  and  vertical  wavelengths  and  the 
observed  periods  of  the  waves.  Studies  of  tides  using  Na  lidars  have  been  also  reported  by 
Batista  et  al.  [1985]  and  Kwon  et  al.  [1987]. 

Because  of  the  influence  of  background  atmospheric  wind  field,  wave  parameters  which 
are  estimated  from  most  ground-based  lidar  observations  are  not  the  intrinsic  parameters.  In 
addition,  because  most  lidar  observations  have  been  made  at  fixed  elevation  angles,  the  studies  of 
gravity  waves  and  tides  are  usually  limited  to  exploring  only  the  vertical  and  temporal  structures 
of  the  waves.  The  horizontal  structure  of  the  waves  is  then  inferred  from  the  vertical  and 
temporal  observations.  Also  very  little  is  known  about  the  geographical  distribution  of  wave 
activity. 

In  order  to  study  the  geographical  distribution  and  horizontal  structure  of  the  waves,  the 
University  of  Illinois  at  Urbana-Champaign  (UIUC)  group  has  conducted  several  Na  lidar 
campaigns  from  1986  to  1988.  The  campaigns  include  an  airborne  campaign  which  was 
conducted  in  November,  1986  with  the  support  of  the  National  Center  for  Atmospheric  Research 
-  Research  Aviation  Facility  (NCAR-RAF)  in  Broomfield,  Colorado.  A  total  of  three  flights 
were  made  over  the  Great  Plains,  Rocky  Mountains,  and  Pacific  Coast  Additional  campaigns 
were  conducted  at  Broomfield,  Colorado  in  November,  1986,  at  Mauna  Kea  Observatory, 

Hawaii  in  January,  1987,  and  at  Nordlysstasjonen,  Svalbard,  Norway  from  June,  1987  to  April, 
1988.  In  order  to  process  the  airborne  data,  new  data  analysis  techniques  have  been  also 
developed.  In  this  thesis,  the  new  data  analysis  techniques  and  the  results  of  the  campaigns  will 
be  presented. 


3 


In  Chs^ter  2,  a  data  analysis  technique  for  determining  gravity  wave  intrinsic  parameters 
will  be  presented.  The  intrinsic  parameters  include  the  horizontal  and  vertical  wavelengths, 
period,  and  wave  propagation  direction.  The  technique  is  successfully  applied  to  the  airborne 
data  collected  during  a  roundtrip  flight  Denver,  Colorado  to  the  Pacific  Coast  in  November, 
1986. 

In  Chapter  3,  a  method  is  presented  for  estimating  kinetic  energy  horizontal  and  vertical 
wavenumber  spectra  of  horizontal  winds  from  the  airborne  data.  These  spectra  are  compared 
with  spectra  obtained  from  ground-based  lidar,  radar,  shunle  re-entry,  and  Global  Atmospheric 
Sampling  Program  observations.  The  rms  horizontal  wind  velocities  inferred  from  the  airborne 
data  are  also  compared  with  those  inferred  from  ground-based  lidar  observations  conducted  in 
Hawaii,  Colorado,  Illinois,  and  Maryland. 

In  November,  1986,  the  University  of  Colorado  group  operated  an  ST  radar  in 
Platteville,  Colorado  in  conjunction  with  the  airborne  lidar  observations  and  ground-based  lidar 
observations  at  Broomfield  and  Denver,  Colorado.  The  results  of  the  joint  radar/lidar 
observations  are  presented  in  Chapter  4. 

In  January,  1987,  the  UIUC  group  installed  and  operated  the  Na  lidar  at  the  low-latitude 
site  of  Mauna  Kea  Observatory(20®N,  155°W),  Hawaii  The  Na  layer  occasionally  exhibited 
sporadic  developments  of  very  dense  narrow  Na  layers.  The  characteristics  of  these  sporadic  Na 
layers  will  be  discussed  in  Chapter  5.  The  characteristics  of  these  layers  are  also  compared  with 
the  characteristics  of  similar  layers  observed  at  Sao  Paulo,  Brazil  (23°S,  46°W)  by  Clemesha  et 
cd.  [1978]  and  at  Andoya,  Norway  (69°N,  16®E)  by  von  Zahn  and  Hansen  [1988], 

From  July  1987  to  April  1988,  a  total  of  five  Na  lidar  campaigns  were  conducted  at 
Nordlysstasjonen,  Svalbard,  Norway  (78°N,  IS^E).  The  characteristics  of  the  Na  layer  observed 
at  this  high  latitude  site  of  Nordlysstasjonen  are  discussed  in  Chapter  6, 


A 


2.  AIRBORNE  SODHJiVl  LIDAR  MEASUREMENTS  OF  GRAVITY 
WAVE  INTRINSIC  PARAMETERS 

2.1  Introduction 

For  almost  20  years,  resonance  fluorescence  lidar  systems  have  been  used  to  study  the 
mesospheric  metals.  Because  of  its  relatively  high  density  and  large  resonant  backscattering 
cross  section,  Na  is  the  easiest  metallic  species  to  measure  with  lidar  techniques  and  has  been 
explored  extensively  since  the  late  1960s.  The  Na  layer  is  generally  conBned  to  the  region 
between  80  and  1 10  km  with  a  peak  near  the  mesopause  at  90-95  km,  where  the  density  ranges 
from  about  10^-10^  cm-^*  The  layer  is  an  excellent  tracer  of  wave  motions,  and  Na  lidar 
observations  are  now  making  important  contributions  to  the  understanding  of  gravity  wave  and 
tidal  dynamics  in  the  mesopause  region  [Batista  et  al.,  1985;  Gardner  and  Voelz  ,1987;  Kwon  et 
a/.,1987].  Recent  lidar-based  studies  of  gravity  waves  have  focused  on  characterizing  quasi- 
monochromatic  waves.  For  example,  by  analyzing  a  total  of  171  quasi-monochromatic  waves 
observed  at  Urbana,  Illinois,  Gardner  and  Voelz  [1987]  found  surprisingly  systematic 
relationships  between  horizontal  and  vertical  wavelengths  and  the  observed  periods  of  the  waves. 
Several  radar-based  studies  of  quasi-monochromatic  waves  have  been  also  reported  by  Meek  et 
al.  [1985],  Reid  and  Vincent  [1987],  and  Manson  and  Meek  [1988]. 

Because  of  the  influence  of  the  background  atmospheric  wind  field,  wave  parameters 
which  are  estimated  from  most  ground-based  lidar  observations  are  not  the  intrinsic  parameters. 
In  order  to  measure  the  intrinsic  wave  parameters  and  to  explore  the  horizontal  stmcture  of 
gravity  waves,  the  UIUC  group  has  conducted  several  airborne  Na  lidar  experiments.  The  first 
airborne  ob.servations  were  conducted  in  March  1983  on  a  single  roundtrip  flight  from  NASA 
Wallops  Flight  Facility  to  Albany,  New  York  [Segal  et  al.,  1984],  This  experiment 
demonstrated  the  feasibility  of  airborne  measurements  and  revealed  evidence  of  wave  induced 
horizontal  structure  over  the  650  km  flight  path.  In  November  of  1986,  the  UIUC  group 
conducted  a  second  airborne  Na  lidar  campaign  with  the  support  of  the  NCAR-RAF  in 
Broomfield,  Colorado.  A  total  of  three  flights  were  made  over  the  Great  Plains,  Rocky 


5 


Mountains,  and  Pacific  Coast  The  flights  were  conducted  out  of  Stapleton  Intemadonal  Airport 
in  Denver,  Colorado  using  the  NCAR  Electra  aircraft 

In  this  chapter,  a  new  data  analysis  technique  for  determining  gravity  wave  intrinsic 
parameters  including  wave  propagadon  direcdon  will  be  presented.  In  Secdon  2.2,  the 
theoredcal  basis  for  estimating  the  intrinsic  wave  parameters  is  discussed.  Several  examples  for 
airborne  lidar  and  muldple  ground-based  lidar  experiments  using  this  technique  are  also 
presented.  In  Secdon  2.3,  the  theoredcal  results  are  successfully  applied  to  airborne  Na  lidar 
data  obtained  during  a  roundtrip  flight  from  Denver  to  the  Pacific  Coast  on  November  17-18, 
1986. 

2.2  Estimation  of  Gravity  Wave  Intrinsic  Parameters 

Recendy,  Gardner  and  Shelton  [1985]  and  Gardner  and  Voelz  [1987]  derived 
expressions  for  the  density  response  of  the  atmospheric  Na  layer  perturbed  by  a  monochromadc 
gravity  wave,  and  have  used  the  expressions  to  determine  the  wave  parameters.  By  retaining 
only  the  fundamental  and  first  order  pcrturbadon  terms,  Gardner  and  Voelz  [1987]  have  shown 
the  density  response  of  the  Na  layer  perturbed  by  a  wave  to  be 

n,(tt)  -  no(i,t) - [ no(r,t)  +  ^ ^ 

where  njiLt)  =  perturbed  Na  density, 

HoClO  =  Na  density  in  the  absence  of  the  wave  aedvity, 
y  =  rado  of  specific  heats  (»  1.4), 

H  =  atmospheric  scale  height  (*  6  km), 

AeP*  =  amplitude  of  the  atnwspheric  density  perturbations  due  to  the  gravity 
wave, 

P  =  amplitude  growth  factor, 

0(l,t)  =  wave  phase, 

I  =  xx  +  yy  +  2z  =  position  vector, 


6 


X 


=  east-west  component  of  the  geocentric  coordinates, 
y  =  north-south  component  of  the  geocentric  coordinates,  and 
z  =  vertical  component  of  the  geocentric  coordinates. 

The  first  term  on  the  right-hand  side  of  Equation  (2.1)  is  the  unperturbed  layer  profile, 
and  the  second  term  is  the  first-order  perturbation  term.  When  the  wave  phase,  0(r  •:),  is  either  0 
or  TZ,  the  magnitude  of  the  perturbation  term  reaches  a  maximum,  and  the  perturbations  result  in  a 
local  maximum  or  minimum  in  the  Na  density  profile.  The  wave  phase  can  be  written  as 

0(Lt)  =  cot  +  i*!:  -k*vbt  (2.2) 

where  co  =  wave  frequency. 

Is  =  +  ky^  +  kjZ  =  wavenumber  vector,  and 

Yb  =  Vbx^  +  Vby9  +  VjjjS  =  background  atmospheric  wind  velocity. 

The  wave  phase  is  a  function  of  timc(t),  horizontal  position  (x,y),  and  altitude(z).  Hence,  the 
altimde  of  a  Na  density  maximum  or  miiumum  varies  as  a  function  of  time  and  location.  By 
measuring  the  times,  horizontal  positions,  and  altitudes  of  a  density  maximum  or  minimum,  the 
intrinsic  wave  parameters  including  O)  and  k  can  be  determined.  Equation  (2.2)  can  be 
rearranged  so  that  the  altitude  variation  of  a  density  maximum  or  minimum  can  be  written  as  a 
function  of  time  and  horizontal  position. 


z  =  ait  +  a2X  +  asy  +  34 

(2.3) 

where  ai  =  - y  +  cos(ab-a  J  +  Vb^ 

(2.4) 

K  . 

32  =  -y-sinOw 

(2.5) 

33  =-*?— COSO^ 

(2.6) 

9o 

(2.7) 

7 


Xz  =  intrinsic  vertical  wavelength  of  the  gravity  wave, 

Xh  =  intrinsic  horizontal  wavelength  of  the  gravity  wave, 

T  ~  intrinsic  wave  period, 

Vbh  ~  horizontal  background  wind  velocity  amplitude, 

Vbz  =  vertical  background  wind  velocity  amplitude, 

Ow  =  azimuth  angle  of  the  horizontal  propagation  direcdon  of  the  gravity  wave, 
ttb  =  azimuth  angle  of  the  horizontal  propagadon  direcdon  of  the  background 
wind,  and 

00  =  wave  phase  at  the  density  maximum  or  minimum. 

The  reladonship  between  the  horizontal  components  of  the  background  wind  and  gravity 
wave  is  illustrated  in  Figure  11.  As  can  be  seen  in  Equadons  (2.3)  and  (2.4),  the  aldtude 
variadons  of  a  Na  density  maximum  or  minimum  are  influenced  by  background  wind.  Equadon 
(2.3),  along  with  the  gravity  wave  poiarizadon  and  dispersion  reladons  and  other  informadon 
which  can  be  calculated  from  the  scdium  density  profiles,  can  be  used  to  determine  the  intrinsic 
parameters  of  the  wave.  For  example,  coefficients  aa  and  as  defined  in  Equadons  (2.5)  and  (2.6) 
are  related  to  the  intrinsic  horizontal  and  verdcal  wavelengths  and  the  horizontal  propagadon 
direction  of  the  wave. 

In  order  to  determine  the  four  coefficients  ai  through  aa  uniquely,  at  least  four 
measurements  of  a  density  maximum  or  minimum  are  needed.  The  aldtude  of  a  density 
maximum  or  minimum  must  be  measured  at  an  appropriate  combination  of  four  different  times 
and  horizontal  positions.  For  example,  at  least  three  measurements  at  three  different  positions 
(not  located  in  a  line)  and  one  additional  measurement  at  one  of  the  three  positions  at  a  different 
time  would  be  sufficient  The  configuration  of  the  three  horizontal  positions  must  form  a  triangle 
to  determine  uniquely  the  wave  propagadon  direction. 

Since  the  wave  phase  is  not  of  interest  in  this  .study  and  coefficient  aj  is  constant,  this 
coefficient  can  be  eliminated  from  the  analysis  by  considering  the  differences  between  the  aldnide 
measurements  at  three  different  positions.  By  using  this  approach  a  minimum  of  three 


8 


Y  (North) 


Figure  2.1.  The  relationship  between  the  horizontal  components  of  the  wave  propagation 
direction  and  background  atmospheric  wind. 


9 


measurements  at  three  positions  is  sufficient  to  determine  the  coefficients  ai  through  as  in 
Equation  (2.3).  Usually  many  more  than  three  measurements  will  be  available  so  that  a 
regression  analysis  can  be  used  to  estimate  all  four  coefficients,  thereby  reducing  the  error. 

In  order  to  minimize  the  effects  of  roundoff  errors  in  computing  the  regression 
coefficients,  it  is  the  usual  practise  to  subtract  a  reference  data  point  horn  each  measurement 
before  the  regression  coefficients  are  calculated.  Typically,  the  reference  data  point  is  the  average 
of  all  the  measurements.  To  simplify  the  mathematics,  subtracdcn  of  the  reference  data  point  will 
not  be  included  in  the  following  analysis.  Assume  that  a  total  of  n  measurements  of  a  density 
maximum  or  minimum  are  obtained  at  several  locations  and  dmes.  The  resulting  system  of 
equadons  can  be  written  in  matrix  form  using  Hquadon  (2.3). 


z 

= 

U  A 

(2.8) 

where  Z 

= 

[Zl  Z2 

...Znf 

(2.9) 

“  ti 

xi  yi 

1  - 

t2 

X2  yi 

1 

U 

4 

4  4 

•  4 

4 

4 

(2.10) 

Xn  yn 

1  - 

A  =  [ai  a2  as  34]^  (2.11) 

where  Zj,  tj,  Xi,  and  yi  correspond  to  the  altitude,  time,  x-coordinate,  and  y-coordinatc  of  the  i'^ 
measurement  By  assuming  that  the  errors  in  the  n  measurements  are  mutually  statistically 
independent,  the  coefficients  ai  through  aa  are  computed  by  using  the  least-squares  solution, 

A  =  [U'^UI'^U'^Z  (2.12) 


10 


The  model  given  by  Equadon  (2.3)  assumes  that  there  is  a  three-dimensional  plane  on  which  the 
wave  phase  is  constant  The  least-squares  soludon  is  a  muldple  linear  regression  fit  of  the 
measurements  to  this  three-dimensional  plane. 

The  wave  propagadon  direcdon  and  ndo  of  the  horizontal  and  verdcal  wavelengths  can 
be  calculated  from  coefficients  a2  and  a?. 

=  (2.13) 


_  1 


(2.14) 


The  intrinsic  verdcal  wavelength  and  wave  amplitude  are  usually  determined  from  the  verdcal 
wavenumber  power  spectra  of  the  Na  density  profiles  [Gardner  and  Voelz,  1987].  The  verdcal 
wavelength  can  also  be  determined  by  measuring  the  spacings  between  maxima  and  minima  in 
the  Na  density  profiles.  The  horizontal  wavelength  can  then  be  calculated  by  muldplying 
Equation  (2.14)  by  the  vertical  wavelength.  The  intrinsic  wave  period  is  determined  by  using  the 
gravity  wave  dispersion  relation. 


where  Tb  =  Brunt- Vaisala  period  (»  5  min). 

The  measurement  accuracies  for  these  wave  parameters  depend  on  the  accuracies  of  the 
computed  coefficients,  a2  and  as,  which  in  turn  depend  on  the  accuracies  of  the  measured  values 
of  the  altitude,  z.  The  covariance  matrix  for  the  coefficients  ai  through  34  estimated  from 
mutually  statistically  independent  observations  is  given  by 

C  =  [  U'^U  r'Var(z)  (2.16) 

where 


11 


“  Var(ai)  Cov(ai,a2)  Cov(ai,a3)  Cov(ai,a4)  ■" 

Cov(ai,a2)  Vaxtaa)  Cov(a2,a3)  Cov(a2,a4) 

C  =  (2.17) 

Cov(ai,a3)  Cov(a2,a3)  Var(a3)  Cov(a3,a4) 

•-  Cov(ai,a4)  Cov(a2,a4)  Cov(a3,a4)  Var(a4)  - 

and  Var(z)  is  the  variance  of  the  error  in  the  altitude  measurements.  The  variances  of  the 

computed  wave  parameters  are 


Var((Xw)  = 


a3Var(a2)  -  2a2a3Cov(a2,a3)  +  a^ar(a3) 
(32+  al)^ 


(2.18) 


Xj,  a2Var(a2)  +  2a2a3Cov(a2,a3)  +  a3Var(a3) 


(a|  +  a|)^ 


(2.19) 


ex  ^  rx 

Var(Xh)  =  XjVai(^^J  +  (^^j  Var(XJ 


(2.20) 


Var  (T)  =  Var(^  j  ^VaifTe) 


(2.21) 


Unfortunately,  the  variance  expressions  given  in  Equations  (2.18)  *  (2.21)  do  not 
provide  any  physical  insights  for  the  expected  errors  in  the  measurements.  Therefore,  two 
examples  of  an  airborne  lidar  experiment  and  a  multiple  ground-based  lidar  experiment  will  be 
presented  to  illustrate  the  major  factors  influei.ving  the  measurement  errors. 

Consider  an  aircraft  flying  over  a  circular  path  of  radius  R  during  a  total  observation 
period  of  Tob.  The  ground  track  of  the  flight  is  illustrated  in  Figure  2.2.  Assume  that  a  total  of  n 
measurements  equally  spaced  along  the  flight  path  are  obtained.  The  covariance  matrix  for  this 
experimental  configuration  is  calculated  in  Appendix  1  along  with  the  rms  errors  in  the  gravity 
wave  parameters.  The  results  arc 


12 


Y (North) 


Figure  2.2.  Ground  track  of  the  circular  flight  path  with  radius  R. 


13 


„ ,  K  Std(z)  fl  !  .1  T" 

Std(a^)*^  — g— ^  -  W  -Jir^sin  a^  +  cos  <x^ 

Std(z)  fl 
’^“V  n 


(2.22) 


(2,23) 


The  rms  errors  in  Ow  and  X)Az  are  proportional  to  tlie  rms  error  in  the  measured  altitude 
of  the  density  maximum  or  minimum,  inversely  proportional  to  the  radius  of  the  flight  path,  and 
inversely  proportionai  to  the  square  root  of  the  number  of  measurements.  The  rms  error  in  the 
aldtude  measurements  is  influenced  by  three  types  of  noise  -*  signal  shot  noise,  background 
noise,  and  "geophysical  noise."  The  shot  noise  is  characterisdc  of  the  photon  coundng  process 
used  to  detect  the  backscattered  Ildar  signal.  The  background  noise  is  introduced  by  the 
background  sky  light  from  the  stars,  moon  and  sun.  Shot  noise  or  background  noise  in  different 
profiles  are  stadsdcally  independent  To  minimize  the  effects  of  shot  noise  and  background 
noise,  Na  density  profiles  are  usually  low-pass  filtered  vertically.  The  "geophysical  noise"  is 
usually  introduced  by  two  types  of  processes  -  background  wind  variadons  and  muldple  waves 
simultaneously  influencing  the  Na  layer.  When  the  background  wind  is  not  constant  over  either 
the  horizontal  or  verdcal  extent  of  the  measurements,  the  wind  could  influence  the  variadons  of 
the  aldtude  of  a  density  maximum  or  minimum  as  seen  in  Equadons  (2.3)  and  (2.4).  This  effect 
will  result  in  error.  Muldple  waves  can  also  influence  the  aldtude  of  the  density  perturbadons 
and  contribute  to  the  error.  For  example,  during  the  roundtrip  flight  from  Denver  to  the  Pacific 
Coast  in  November  1986,  a  dominant  wave  induced  both  a  density  maximum  and  minimum  in 
the  Na  layer.  By  using  the  least-squares  soludon  in  Equadon  (2.12),  the  coefficients  ai  through 


14 


34  were  calculated  for  both  the  density  maximum  and  minimum.  The  altitudes  of  the  density 
maximum  and  minimum  were  then  recalculattd  by  substituting  the  estimated  coefficients  along 
with  the  measured  times  and  horizontal  positions  into  Equation  (2.3).  The  rms  deviation  of  the 
difference  between  the  measured  altitudes  and  the  least-squares  model  was  approximately  290  m 
for  the  density  maximum,  and  230  m  for  the  density  minimum.  Geophysical  noise  was  the 
major  cause  of  these  variations.  Thus  typical  values  of  Std(z)  will  vary  between  200  and  500  m. 
Depending  on  the  profile  integration  time,  geophysical  noise  in  different  profiles  may  be  highly 
correlated.  If  so,  the  number  of  measurements,  n,  in  Equations  (2.22)  and  (2.23)  must  be  set 
equal  to  the  total  number  of  statistically  inctependent  measurements. 

The  flight  radius,  R,  also  influences  the  measurement  errors.  The  error  in  estimating  the 
three-dimensional  plane  of  a  constant  wave  phase  will  be  smaller,  when  the  observations  are 
made  over  a  larger  horizontal  extent.  Notice  that  the  errors  in  Ow  and  XhA>z  are  also  proportional 
to  XhAz  *  T/Tb.  To  maintain  a  given  level  of  accuracy,  the  flight  path  radius  should  be  large 
when  observing  waves  with  a  long  period  or  large  horizontal  wavelength.  However,  the  radius 
can  not  be  chosen  arbitrarily  large  because  the  wave  front  must  be  coherent  at  each  measurement 
point  and  the  measurements  must  all  be  made  on  the  same  wave  front  A  radius  comparable  to 
the  horizontal  wavelength  is  probably  a  good  compromise. 

Consider  now  a  configuration  of  three  ground-based  lidars  at  the  comers  of  an  equilateral 
triangle  with  sides  of  length  R  as  illustrated  in  Figure  2.3.  Assume  that  a  total  of  n  independent 
measurements  are  obtained  simultaneously  at  each  of  the  three  lidar  sites  during  a  total 
observation  period  of  Tob.  The  covariance  matrix  and  the  measurement  errors  are  calculated  for 
this  configuration  in  Appendix  2.  The  results  are 

Std(a^)  =  —  (2.24) 


15 


Y  (North) 


Figure  2.3.  Configuration  for  three  ground-based  lidars  located  at  the  comers  of  an  equilateral 
triangle  with  sides  of  length  R. 


16 


(2.25) 


ks<6(z)  n 

“TliT”  ^  R  V  n 

Id 

Notice  that  the  equations  for  the  measurement  errors  for  the  ground-based  experiment  are  quite 
similar  to  those  for  the  circular  flight  experiment 

The  measurement  error  in  Xi  (see  [Gardner  and  Voelz,  1987])  and  the  error  on  the 
assumed  value  of  Tb  are  usually  negligible.  A  measurement  accuracy  of  0.1  rad  (>  6°)  or  better 
for  the  propagation  direction  and  a  measurement  accuracy  of  10  %  or  better  for  the  intrinsic 
period  and  horizontal  wavelength  would  require 

XhS^  fl 

Waves  usually  remain  coherent  in  Na  density  profiles  for  1  to  2  hours.  Modem  lidars  can  obtain 
accurate  measurements  of  the  Na  density  profile  in  1  to  2  min.  Thus,  n  is  typically  50- 100  under 
normal  conditions  by  assuming  each  measurement  is  statistically  independent  of  every  other 
measurement.  As  mentioned  previously,  Std(z)  is  dominated  by  geophysical  noise  with  values 
typically  in  the  range  of  200  to  500  m.  By  assuming  that  n  =  50,  Std(z)  =  500  m  and  R  ~  Xh, 
Equation  (2.26)  is  satisfied  for  all  waves  with  ^  1  km. 

In  addition  to  the  airborne  and  multiple  ground-based  measurements,  steerable  lidar 
measui'ements  can  also  be  used  to  determine  the  gravity  wave  parameters.  Because  of  the 
increased  atmospheric  attenuation  and  longer  propagation  path  lengths  at  lower  elevation  angles, 
the  maximum  zenith  angle  for  the  steerable  measurements  is  usually  limited  to  about  45®.  At  the 
altitudes  of  the  Na  layer  near  90-100  km,  a  laser  beam  directed  at  the  zenith  angle  of  45°  would 
intersect  the  Na  layer  at  a  horizontal  distance  of  about  100  km  from  the  point  directly  above  the 
lidar  site.  Thus,  the  longest  practical  horizontal  baseline  for  the  steerable  measurements  would 
be  approximately  200  km.  Based  on  the  calculations  presented  above,  it  appears  that  a  steerable 


17 


lidar  can  be  used  to  measure  accurately  the  intrinsic  parameters  of  gravity  waves  with  horizontal 
wavelengths  of  about  400  km  or  less. 

2.3  Experimental  Data 

In  early  November  of  1986,  the  UIUC  Na  lidar  system  was  installed  on  the  Electra 
aircraft  operated  by  NCAR-RAF.  Major  lidar  and  aircraft  panuneters  are  summarized  in  Table 
2.1.  Following  the  installadon,  a  total  of  three  flights  were  conducted  out  of  Stapleton 
International  Airport  in  Denver,  Colorado  from  November  13  to  18,  1986.  The  ground  tracks  of 
these  flights  are  illustrated  in  Figure  2.4.  The  flight  and  system  performance  characterisdcs  are 
summarized  in  Table  2.2.  During  the  three  flights,  a  total  of  425  Na  density  profiles  were 
collected.  The  integradon  period  for  each  profile  was  100  s.  The  horizontal  distance 
corresponding  to  this  integradon  period  varied  from  1 1  to  20  km,  depending  on  the  ground  speed 
of  the  aircraft. 

In  this  secdon,  the  characterisdcs  of  a  quasi>monochromadc  wave  observed  during  the 
westward  flight  conduaed  on  the  night  of  November  17-18  will  be  analyzed.  The  flight 
included  one  eastbound  leg  and  one  westbound  leg.  In  Figure  2.5,  two  sequences  of  Na  density 
profiles  collected  during  the  two  flight  legs  arc  plotted  versus  longitude  in  the  range  from  1 17°W 
to  126®W.  The  profiles  have  been  filtered  verucally  with  a  cutoff  of  3  km  and  horizontally  with  a 
cutoff  of  50  km.  Also  the  profiles  have  been  normalized  so  that  each  has  the  same  column 
abundance,  and  are  plotted  on  a  linear  scale.  At  the  flight  aldtude  of  approximately  8  km,  the 
prevailing  winds  were  eastward.  Consequendy,  the  ground  speed  of  the  aircraft  was  much 
slower  during  the  westbound  leg,  and  the  horizontal  distance  corresponding  to  the  profile 
integradon  period  was  much  shorter.  On  the  westbound  leg  in  the  longitude  range  from  1 13°W 
to  1 17®W,  overhead  clouds  obscured  the  lidar,  and  no  data  were  collected.  Note  the  dominant 
density  perturbations  in  the  altimde  range  from  85  to  90  km  in  the  density  profiles  measured  on 
both  the  westbound  and  eastbound  legs.  These  perturbations  appear  to  be  influenced  by  a  quasi- 
monochromatic  wave. 


18 


Table  2.1.  The  Parameters  of  the  UIUC  Na  Lidar  System 
and  NCAR  Electra  Aircraft 


Transmitter 

Laser 

Wavelength 
Energy 
Pulse  Width 
Repeddon  Rate 
BeamlHvergence 


Flashlamp-pumped  dye  laser  (Candela  LFDL-1) 

589  nm 

SO  mJ/pulse 

2psFWHM 

7.5  Hz 

3  mrad  FW  @  e*^ 


Receiver 

Telescope 
Aperture  Area 
Field-of-view 
Opdeal  Bandwidth 
Verdcal  Range  Resoludon 


35  cm  diameter  Classegrain  (Celestron  14) 

0.1  m2 

3mrad 

0.5  nm  FWHM 
150  m 


Aircraft 

Model  Electra  4  engine  turboprop 

Nominal  Cruising  Aldtude  6  -  8  km 

Ground  Speed  During  Observadons  1 10  -  200  m  s*  1 


19 


20 


Figure  2.4.  Ground  tracks  of  the  three  flights  of  the  airborne  Na  lidar  experiment  in  November, 
1986.  The  flights  were  conducted  out  of  Denver,  Colorado. 


Table  12.  Summary  of  the  Flights  and  System  Performance 


Flight 

1 

2 

3 

Date 

Nov.  13 

Nov.  15-16 

Nov.  17-18 

Observation  Time  (MST) 

2119-2357 

2210-0309 

2142-0502 

Observadon  Duration  (hours) 

2.6 

5.0 

7.3 

Flight  Pattern 

Triangular 

Eastward 

Westward 

Location 

COJ^MAZ,WY 

CO-IL 

CO-Pacific  Coast 

Na  Signal  Level® 

(total  Na  counts/shot) 

10 

7 

8 

Total  Observation  Time  =  14.9  hours 


^Typical  ground-based  signal  level  at  Urbana  is  5  counts/shot 


21 


WESTWARD  FLIGHT  (NOVEMBER  17-18, 1986) 
EASTBOUND  LEG 


22 


Figure  2  5  Sodium  density  profiles  collected  during  the  westbound  and  eastbound  legs  of  the 
westward  flight  on  November  17-18,  1986.  Tlie  profiles  have  been  1  titered 
vertically  with  a  cutoff  of  3  km  and  horizontally  with  a  cutoff  of  50  km.  The 
profiles  have  been  also  nomialized  .so  that  each  has  the  same  column  abundance, 
and  are  plotted  on  a  linear  scale. 


The  altitude  variations  of  a  local  Na  density  maximum  and  minimum  induced  by  the 
dominant  wave  on  the  bottomside  of  the  layer  are  plotted  versus  time  in  Figure  2.6.  The  altitudes 
were  computed  by  first  filtering  the  Ka  density  profiles  vertically  with  a  cutoff  of  2  km.  Between 
0115  and  01 30  MST,  the  aircraft  changed  flight  direction  from  approximately  277**  azimuth 
(westward)  to  80**  azimuth  (eastward),  and  no  profiles  were  collected  during  this  period.  Circles 
represent  the  altitudes  of  the  density  maximum,  and  crosses  represent  the  altitudes  of  the  density 
minimunL  A  total  of  46  measurements  of  the  density  maximum  were  obtained.  The  averages  of 
the  measured  times,  horizontal  positions,  and  altitudes  were  computed  and  used  for  the  reference 
data  point  in  the  regression  model.  The  coefficients  ai  through  aa  were  calculated  by  using  the 
least-squares  solution  in  Equation  (2.12),  and  the  altitudes  of  the  density  maximum  were  then 
recalculated  by  substituting  the  estimated  coefficients  along  with  the  measured  times  and 
positions  into  Equation  (2.3).  The  results  are  plotted  as  the  solid  line  near  the  altitudes  of  the 
density  maximum  in  Hgure  2.6.  The  standard  deviation  of  the  measured  altitudes  from  the  least- 
squares  fitted  altitudes  was  290  m.  The  data  for  the  density  minimum  are  processed  in  a  similar 
manner.  A  total  of  SO  measurements  were  obtained,  and  the  solid  line  near  the  altitudes  of  the 
density  minimum  is  the  least-squares  fit  for  these  measurements.  The  standard  deviation  of  the 
measured  altitudes  from  the  least-squares  fitted  altitudes  was  230  m. 

The  vertical  wavelength  of  the  wave  can  be  estimated  using  several  approaches.  The 
average  vertical  separation  between  the  least-squares  fits  of  the  density  maxima  and  minima 
plotted  in  Figure  2.6  is  approximately  equal  to  XJl.  However,  the  steep  density  gradients  on  the 
bottomside  of  the  Na  layer  will  result  in  separation  distances  that  are  slightly  shorter  than  XJl. 
For  the  data  plotted  in  Figure  2.6,  the  least-squares  fits  are  separated  by  an  average  distance  of 
1.9  km  so  that  Xz  is  slightly  larger  than  3.8  km. 

The  vertical  wavelength  of  the  dom:r!ant  wave  can  also  be  estimated  from  the  vertical 
wavenumber  power  spectrum  of  the  Na  profiles.  Tne  technique  is  described  in  detail  by  Gardner 
and  Voelz  [1987].  The  power  spectrum  of  the  Na  profiles  is  plotted  in  Figure  2.7.  The  vertical 
wavelength  of  the  dominant  wave  which  was  responsible  for  the  density  perturbations  in  the 


23 


ALTITUDE  (km) 


MOUNTAIN  STANDARD  TIME 


Figure  2.6.  The  altitude  variations  of  a  local  Na  density  maximum  and  minimum  measured 
during  the  westward  flight  on  November  17-18,  1986.  Circles  represent  the 
altitudes  of  the  density  maximum,  and  crosses  represent  the  altitudes  of  the  density 
minimum.  The  solid  lines  represent  the  least- sqiuu'es  fitted  altitudes  of  the  density 
maximum  and  minimum. 


VERTICAL  POWER  SPECTRUM  (dB) 


WESTWARD  FLIGHT  (NOVEMBER  17-18,1986) 


Figure  2.7.  Average  vertical  wavenumber  power  spectrum  of  the  Na  density  profiles  collected 
from  0131  to  0138  MST  during  the  westward  flight  on  November  17-18, 1986. 


25 


altitude  region  from  8S  :o  90  km  (Figure  2.5)  is  believed  to  be  4.1  km.  The  amplitude  of  this 
wave  is  also  calculated  from  the  spectrum  to  be  16  m  s*^  [Gardner  and  Voelz ,  1987]. 

The  coefficients  estimated  from  the  least-squares  solutions  were  used  in  Equations  (2.13) 
through  (2.15)  to  calculate  the  gravity  wave  propagation  direction,  ratio  of  horizontal  to  vertical 
wavelength,  and  intrinsic  period.  The  vertical  wavelength,  wave  amplitude,  and  amplitude 
growth  length  were  calculated  from  the  Na  profile  power  spectrum.  The  results  are  summarized 
in  Table  2.3.  The  wave  parameters  calculated  from  the  variations  of  the  density  maximum  are 
quite  comparable  to  those  calculated  from  the  variations  of  the  density  minimum,  which  indicates 
that  the  dominant  wave  was  influencing  both  the  density  maximum  and  ntinimutn.  The  wave 
propagation  angle  was  almost  due  south.  The  large  values  for  the  standard  deviations  of  the 
estimated  horizontal  wavelengths  and  periods  were  the  results  of  large  variances  computed  for 
the  ratio  of  the  horizontal  to  vertical  wavelengths,  in  Equation  (2.19).  The  large  variances 

appear  to  be  the  result  of  the  directions  of  the  two  flight  legs  which  were  both  approximately 
orthogonal  (westbound  and  eastbound)  to  the  wave  propagation  direction  (southbound).  By 
assuming  the  Vbj »  0,  it  is  also  possible  to  estimate  the  background  horizontal  wind  velocity  in 
the  direction  of  wave  propagation  by  using  Equation  (2.4).  The  calculated  background 
horizontal  wind  velocity  was  about  5  m  s'l  northward,  which  is  comparable  to  the  meridional 
background  wind  measured  at  Platteviile,  Colorado  (40®N,  105®W)  near  90  km  altitude  during 
the  period  from  November  4  -  20, 1986  [Kwon  et  al.,  1989b].  The  horizontal  distance  between 
the  location  of  the  airborne  lidar  observations  and  the  location  of  the  source  of  a  wave  can  be 
estimated  roughly  by  multiplying  the  height  of  the  Na  layer  by  the  ratio  of  the  intrinsic  horizontal 
wavelength  to  the  vertical  wavelength  of  the  observed  wave.  For  the  dominant  wave  observed 
during  the  flight,  the  horizontal  distance  between  the  measurement  locations  and  the  tropospheric 
source  was  computed  to  be  approximately  18(X)  km.  Interestingly,  during  the  period  from 
November  15  to  18,  a  meteorological  low  pressure  system  developed  over  the  Gulf  of  Alaska, 
which  is  about  2800  km  north  of  the  flight  path.  Although  it  is  speculative,  this  low  pressure 
system  might  be  responsible  for  the  observed  wave. 


26 


Table  2.3.  Intrinsic  Parameters  of  the  Dominant  Wave  Observed 
During  the  Westward  Flight  on  November  17-18,  1986 


Altitude  Vaiiadons 

Density  Maximum 

Density  Minimum 

Data  Points 

46 

50 

Vertical  Wavelength  (km) 

4.1  (±0.03) 

4.1  (±0.03) 

Horizontal  Wavelength  (km) 

80  (±28) 

89  (±27) 

Period(min) 

97  (±344) 

107  (±32a) 

Wave  Propagation  Direction  (®) 

178.1  (±0.4) 

178.6  (±0.3) 

Wave  Amplitude  (m  s-i) 

17  (±1) 

17  (±1) 

Amplitude  growth  length  (km) 

16  (±2) 

16  (±2) 

^Calculated  by  assuming  VarCTfi) »  0  in  Equation  (2.21). 


27 


There  are  other  waves  influencing  the  Na  layer  during  the  flight  which  appear  to  be 
propagating  westward.  The  horizontal  wavenumber  spectra  computed  for  the  bottomside  and 
topside  of  the  Na  layer  measured  during  the  westbound  and  eastbound  legs  of  the  westward 
flight  are  plotted  in  Figures  2.8  a  and  b,  respecdvely.  The  technique  for  esdmating  these 
horizontal  wavenumber  spectra  is  described  in  detail  by  Kwon  et  al.  [1989al.  The  horizontal 
wavenumber  is  the  inverse  of  zonal  distance  along  the  flight  legs.  Note  the  spectral  peaks  in  the 
bottomside  spectrum  for  the  westbound  observadons  (Figure  2.8  a)  near  the  wavelengths  of  457 
km  and  316  km,  and  the  peaks  in  the  bottomside  spectrum  for  the  eastbound  observadons 
(Figure  2.8  b)  near  319  km  and  207  km.  The  peak  near  457  km  in  Figure  2.8  a  appears  to 
correspond  to  the  peak  near  319  km  in  Figure  2.8  b,  and  the  peak  near  316  km  in  Figure  2.8  a 
appears  to  correspond  to  the  peak  near  207  km  in  Figure  2.8  b.  These  peaks  may  be  related  to 
westward  propagating  waves.  In  the  bottomside  spectrum  measured  on  the  westbound 
observadons  (Figure  2.8  a),  these  peaks  seem  to  be  Doppler  shifted  to  lower  wavenumbers.  The 
observed  Doppler  shifted  wavelength  and  the  intrinsic  zonal  wavelength  are  related  by  the 


following  equadon  [Kwon  etai,  1989a]. 


(2.27) 


where  X\n  =  intrinsic  wavelength  along  the  flight  path, 

Xob  =  observed  wavelength  along  the  flight  path, 

Vob  =  Vp  +  Vb  =  observed  phase  velocity  along  the  flight  path, 

Vp  =  intrinsic  phase  velocity  along  the  flight  path, 

Vb  =  background  atmospheric  wind  velocity  along  the  flight  path  at  the  aldtudes 
of  the  Na  layer,  and 
Vj  =  aircraft  velocity. 

Because  observadons  were  made  over  both  the  westbound  and  eastbound  flight  legs,  and 
the  aircraft  velocity  is  known,  Equadon  (2.27)  can  be  solved  for  both  the  intrinsic  wavelength 
and  observed  phase  velocity  along  the  flight  path.  By  using  the  two  observed  wavelengths  of 


28 


HORIZONTAL  VELOCITY  SPECTRUM  ( (m/s)  /  (cyc/m) ) 


WESTWARD  FLIGHT  -  WESTBOUND  LEG 


Figure  2.8  a).  Horizontal  wavenumber  spectra  computed  for  the  bottomside  and  topside  of  the 
Na  layer  measured  during  the  westbound  leg  of  the  westward  flight  on  November 
17-18, 1986.  The  data  were  filtered  vertically  with  a  cutoff  of  1  km  before  the 
spectra  were  computed. 


29 


ATMOSPHERIC  DENSITY  SPECTRUM  (m/cyc) 


HORIZOMTAL  VELOCITY  SPECTRUM  ( (m/s)  /  (cyc/m) ) 


WESTWARD  FLIGHT  -  EASTBOUND  LEG 


Figure  2.8  b).  Horizontal  wavenumber  spectra  computed  for  the  bottomside  and  topside  of  :hc 
Na  layer  measured  during  the  eastbound  leg  of  the  westward  flight  on  November 
17-18, 1986.  The  data  were  filtered  vertic^ly  with  a  cutoff  of  1  km  before  the 
spectra  were  computed. 


30 


ATMOSPHERIC  DENSITY  SPECTRUM  (m/cyc) 


316  km  and  207  km,  the  intrinsic  zonal  wavelength  of  this  wave  is  computed  to  be  243  km,  and 
the  observed  phase  velocity  is  approximately  31  m  s‘l  westward.  The  average  zonal  wind 
velocity  measured  at  Platteviile,  Colorado  was  about  5  m  s*^  eastward  over  the  altitude  range  of 
the  Na  layer  [Kwon  etal.,  1989bl.  By  subtracting  this  background  wind  velocity  from  the 
observed  phase  velocity,  the  intrinsic  zonal  phase  velocity  is  calculated  to  be  36  m  s*^  westward. 
The  intrinsic  wave  period  is  then  calculated  to  be  1 1 1  min,  or  approximately  2  hours.  Therefore, 
there  appeared  to  be  at  least  two  waves  with  the  periods  of  102  min  and  1 1 1  min  influencing  the 
Na  layer  during  the  flighr,  one  wave  was  propagadng  westward,  and  the  other  wave  was 
propagating  southward.  For  the  wave  corresponding  to  the  spectral  peak  at  457  km  in  Figure 
2.8  a,  and  the  peak  at  319  km  in  Figure  2.8  b,  the  intrinsic  zonal  wavelength  is  approximately 
366  km,  the  intrinsic  zonal  phase  velocity  is  32  m  s*i  westward,  and  the  intrinsic  period  is  192 
min. 

Dominant  waves  with  the  period  of  approximately  2  hours  have  been  observed  often  with 
the  UIUC  Na  lidars  at  Urbana,  Illinois  (40®N)  [Gardner  et  a/.,  1986;  Gardner  and  Voelz,  1987; 
Gardner,  1989],  at  Broomfield,  Colorado  (40"N)  [Kwon  etal,  1989b],  and  during  the  eastward 
flight  on  November  15-16, 1986  which  was  conducted  over  the  Great  Plains  in  the  latitude  range 
from  39®N  to  40°N  (Figure  2.4)  [Kwon  et  al,  1989a].  These  waves  were  dominant  only  on  the 
bottomside  of  the  Na  layer  (80-90  km).  The  zonal  component  of  the  2-hour  period  wave 
measured  during  the  eastward  flight  was  observed  to  propagate  westward  at  an  apparent  velocity 
of  38  m  s*^  and  the  intrinsic  zonal  wavelength  of  this  wave  was  263  km. 

In  order  to  estimate  spectral  slopes  for  the  horizontal  wavenumber  spectra  plotted  in 
Figures  2.8  a  and  b,  linear  regression  fits  were  performed  over  horizontal  scales  from  70  to  700 
kra  The  spectral  slopes  for  the  bottomside  and  topside  spectra  for  the  westbound  observations 
(Figure  2.8  a)  were  -1.69  and  -1.05,  respectively.  The  slopes  for  the  bottomside  and  topside 
spectra  for  the  eastbound  observations  (Figure  2.8  b)  were  -1.41  and  -0.71,  respectively.  The 
spectral  slopes  for  the  topside  spectra  were  consistently  shallower  than  those  of  the  bottomside 
spectra.  The  shallower  slopes  of  the  topside  spectra  may  be  the  result  of  saturation  affecting  the 


31 


larger  scale  waves.  The  topside  spectral  amplitudes  were  larger  which  indicates  that  the  wave 
amplitudes  were  growing  with  altitude. 

2.4  Summary 

A  data  analysis  technique  for  determining  intrinsic  gravity  wave  parameters  including 
wave  propagation  direction  has  been  presented.  This  technique  involves  measuring  the  altitude 
variations  of  density  perturbations  induced  in  the  mesospheric  Na  layer  by  gravity  waves.  This 
technique  can  be  used  with  airborne  iidars,  multiple  ground-based  Ildars,  and  steerable  lidars. 
Several  examples  have  been  presented  to  show  the  expected  measurement  accuracies  for  the 
wave  parameters.  Although  the  technique  was  demonstrated  using  Na  lidar  data,  it  can  also  be 
used  with  Rayleigh  lidars. 

The  technique  has  been  successfully  applied  to  the  airborne  Na  lidar  data  obtained  during 
the  roundtrip  flight  from  Denver,  Colorado  to  the  Pacific  Coast  in  November  1986.  The 
horizontal  wavelength  of  a  dominant  wave  observed  during  the  flight  was  approximately  85  km, 
and  the  vertical  wavelength  was  4.1  kra.  The  intrinsic  period  was  102  min,  and  the  propagation 
direction  was  almost  due  south. 

The  wave  parameters  of  two  additional  gravity  waves  have  been  estimated  from  the 
horizontal  wavenumber  spectra  measured  during  the  flight  One  of  the  waves  had  an 
approximate  period  of  2  houn,  and  its  zonal  component  was  propagating  westward  at  a  velocity 
of  31  m  s-i.  The  parameters  of  this  wave  were  quite  similar  to  those  of  the  2-hour  period  wave 
observed  during  the  eastward  flight  on  November  15-16, 1986.  Based  upon  these 
measurements  and  ground-based  observations  made  at  Urbana,  Illinois,  it  appears  that  the  2-hour 
period  waves  often  dominate  the  bottomside  of  the  Na  layer  (80-90  km)  at  mid-latitudes  over 
North  America. 


32 


3.  AIRBORNE  SODIUM  LIDAR  OBSERVATIONS  OF  HORIZONTAL  AND 
VERTICAL  WAVENUMBER  SPECTRA  OF  MESOPAUSE  DENSITY  AND 

WIND  PERTURBATIONS 

3.1  Introduction 

Gravity  waves  play  an  important  role  in  the  dynamics  of  the  mesopause  region.  Because 
this  region  is  relatively  inaccessible  to  in-situ  measurements,  most  studies  have  relied  on  remote 
sensing  techniques.  In  recent  years,  Na  lidar  techniques  have  been  developed  to  provide 
important  data  on  gravity  waves  and  tides  near  the  mesopause.  The  Na  layer  is  generally 
confined  to  an  altitude  range  from  80  to  1 10  km,  and  the  peak  density  typically  occurs  at  90  km. 
The  Na  layer  is  particularly  useful  for  studying  mesopause  dynamics,  because  the  layer  profile  is 
very  sensitive  to  gravity  wave  perturbations.  The  amplitudes  of  the  waves  are  usually  large  near 
the  mesopause,  and  the  steep  Na  density  gradients  on  the  bottom  and  topsides  of  the  layer  tend  to 
enhance  the  observed  wave  perturbations.  Gardner  and  Voelz  [1987]  reported  an  extensive 
analysis  of  the  monochromatic  gravity  waves  observed  in  the  Na  layer  above  Urbana,  Illinois. 
Studies  of  atmospheric  tides  using  Na  lidars  have  also  been  reported  by  Batista  et  al.  [1985]  and 
Kwonetal.[mi'\. 

Because  most  radar  and  lidar  observations  have  been  made  at  fixed  elevation  angles, 
studies  of  gravity  waves  are  usually  limited  to  exploring  only  the  vertical  and  teiiiporal  strucmres 
of  the  waves.  Radar-based  studies  of  gravity  waves  have  shown  an  approximate  kz*3  dependence 
in  vertical  wavenumber  spectra  of  the  horizontal  wind  perturbations  [Smith  et  al.,  1985;  Tsuda  et 
al..  1988],  and  an  approximate  f*^^  dependence  in  the  temporal  frequency  spectra  [Balsley  and 
Carter,  1982;  Vincent,  1984].  Fritts  et  al.  [1989]  have  presented  the  only  measurements  of  the 
horizontal  wavenumber  spectra  of  the  mesopause  region.  'Fhey  calculated  the  spectra  in  the 
altimde  range  from  60  to  90  km  from  atmospheric  density  variations  measured  during  seven  re¬ 
entries  of  NASA  space  shuttles.  The  horizontal  wavenumber  spectra  exhibited  a  slope  of 
approximately  -2  at  horizontal  scales  from  10  to  1000  km. 


33 


With  the  advent  of  airborne  Na  lidar  techniques,  the  study  of  the  horizontal  structure  of 
gravity  waves  has  become  more  feasible.  In  March  1983,  the  UIUC  group  conducted  the  fint 
airborne  Na  lidar  observations  on  a  single  roundtrip  flight  horn  the  NASA  Wallops  Flight 
Facility  to  Albany,  New  York  [Segcd  et  al.,  1984].  This  experiment  demonstrated  the  feasibiliQr 
of  airborne  measurements  and  revealed  evidence  of  wave  induced  horizontal  structures  over  the 
650  km  flight  path. 

In  November  of  1986,  the  UIUC  group  conducted  another  airborne  Na  lidar  campaign 
with  the  support  of  the  NCAR<RAF  in  Broomfield,  Colorado.  In  order  to  investigate  the 
longitudin?i  characteristics  of  gravity  waves  at  mid-latitudes,  a  total  of  three  flights  were  made 
over  the  Great  Plains,  Rocky  Mountains,  and  Pacific  Coast.  The  flights  were  conducted  out  of 
Stapleton  International  Airport  in  Denver,  Colorado  using  the  NCAR  Electra  aircraft  The  kinetic 
energy  horizontal  and  vertical  wavenumber  spectra  of  horizontal  winds  inferred  from  the  airborne 
Na  density  profiles  are  presented  in  this  chapter.  The  theoretical  basis  for  estimating  the 
horizontal  wind  amplitudes  and  the  spectra  from  the  Na  hdar  data  is  described  in  Section  3.2. 

The  experiment  is  described  in  Section  3.3,  and  the  experimental  data  are  presented  and 
discussed  in  Section  3.4.  The  airborne  data  are  compared  with  the  similar  observations  obtained 
with  other  techniques  in  Section  3.5. 

3.2  Layer  Density  Response 

The  density  response  of  an  atmospheric  layer  composed  of  a  minor  neutral  constinient  to 
a  wave  induced  wind  perturbation  is  governed  by  the  continuity  equation.  By  neglecting 
diffusion  and  chemical  effects  and  assuming  the  unperturbed  layer  is  horizontally  homogeneous, 
Gardner  and  Shelton  [1985]  have  shown  that  the  density  response  can  be  written  in  the  form 

ns(E.t)  =  e‘^no(z-0z)  (3.1) 

where 

ns(C>0  =  Na  density  at  position  p  and  time  t 


34 


no(z)  =  steady-state  Na  density  profile  in  the  absence  of  wind 
perturbations 

lit  -  zz  s  position  vector  where  x  is  the  horizontal  coordinate  and  z 
is  the  vertical  coordinate 

and  (|>  and  62  are  solutions  to  the  partial  differential  equations 


V-Y-Y-Vi(i 

(3.2) 

^=v,-v.ve. 

(3.3) 

where  V  =  v^  x  +  Vj  z  is  the  atmospheric  velocity  field.  If  the  wind  perturbations  are  small  the 

second  terms  on  the  right-hand  sides  of  (3.2)  and' (3.3)  can  be  neglected  [Gardner  and  Shelton, 

1985]  so  that 

1 

"J  V*Vdx 

•00 

(3.4) 

9z(r.t)  «/  V2dx 

(3.5) 

In  this  case  the  layer  response  is  given  by 

ns  (Lt)  »  exp  ( V«V  dx )  no(  z  -f 


Vjdx) 


(3.6) 


The  layer  perturbations  result  from  a  multiplicative  distortion  due  to  the  wind  divergence  and  a 
vertical  displacement  of  the  layer  caused  by  the  vertical  winds.  The  relative  importance  of  these 
two  effects  depends  on  the  spatial  scale  of  the  wave  causing  the  wind  fluctuations. 

For  the  following  analysis  it  is  most  convenient  to  work  with  the  natural  logarithm  of  the 
ratio  of  the  perturbed  and  unperturbed  Na  density  profiles.  If  the  unperturbed  Na  density  profile 
is  modeled  as 

rio(z)  =  exp  [  -  g(z)  ]  (3.7) 

then  Equation  (3.1)  can  be  used  to  show  that 


35 


rj(B,t)  =  In  (nj/flo)  =  -  <>  -  g(  z  •  )  +  g(z)  (3.8) 

By  expanding  g(z)  in  a  Taylor  series  about  the  centroid  height  (z^)  of  the  unperturbed  layer  and 
by  retaining  terms  out  to  first  order  in  ^  rs(Q,t)  in  Equation  (3.8)  can  be  written 

rj(B.t)  =  -<{>-  g’(z)  02  (3.9) 

By  assuming  that  g(z)  is  approximately  quadratic  or  equivalently  that  the  unperturbed  layer  is 
approximately  Gaussian,  Equadon  (3.9)  can  be  simplified 

rs(lLt)  ^  0,  (3.10) 

% 

where  Og  is  the  rms  thickness  of  the  unperturbed  sodium  layer.  By  assuming  that  the 
atmospheric  density  profile  is  approximately  exponential 

na(z)  =  noe*^**"o^^  (3.11) 

where  H  is  the  atmospheric  scale  height  and  rig  is  the  density  at  altitude  Zg,  the  log'ratio  of  the 
perturbed  and  unperturbed  atmospheric  densities  is  [Gardner  et  al.,  1989] 

ra(2.t)  =  -<j>  +  02/H  (3.12) 

For  gravity  wave  perturbations,  it  is  possible  to  simplify  Equations  (3.10)  and  (3.12)  for 
the  relative  density  perturbations.  When  the  vertical  wavelength  \  and  frequency  co  satisfy  the 
conditions  \  «  4iiH  and  co  «  N,  where  N  is  the  Brunt- Vaisala  frequency,  the  gravity  wave 
polarization  and  dispersion  relations  can  be  used  to  show  that 

8*  =  7H(|)  (3.13) 

where  Y  is  the  ratio  of  specific  heats.  In  this  case,  the  relative  density  perturbations  can  be 
written  as 

rsdLi)  =  -  [  1  -  yH(  z  -  Zg  )/(jJ  ]  (l>(E,t)  (3.14) 

ra(C,t)  =  -(  1 -Y)<|)(C,t)  (3.15) 

The  gravity-wave  polarization  and  dispersion  relations  can  be  used  to  relate  the  vertical 


36 


wavenumber  (kx)  spectrum,  horizontal  wavenumber  (k^)  spectrum  and  mean-square  value  of  (|>  to 
the  vertical  and  horizontal  wavenumber  gravity  wave  power  spectra  (E^Ckj^  and  H^Ckx))  and 
mean-square  value  of  the  horizontal  winds  (<  v^  >)  [Gardner  et  al,  1989]. 

Ex(kx)  =  (7HN)^E^(kx)  (3.16) 


Ex(kx)  =  (7HN)^E^(kx) 


(3.17) 


<v^>  =  (^IN)^<(|>^>  (3.18) 

By  combining  Equations  (3.14),  (3.15)  and  (3.18)  the  mean-square  relative  sodium  density 
perturbations  can  be  expressed  in  terms  of  the  mean-square  atmospheric  density  perturbations  or 
the  mean-square  horizontal  winds. 

<r5(£.t)>  =[l-1^(z-Zo)/oJf  <r^(E.t)>/(Y- 1)^ 

=  [  1  -  YH(  z  -  Zo  )/o5  ]2  <  v^  (E.t)  >  /  (tHN)^  (3.19) 

Because  the  ensemble  average  <  rj  >  can  be  approximated  by  spatially  averaging  the  relative  Na 
density  perturbations  throughout  the  vertical  extent  of  the  layer,  it  is  easy  to  show  that 

2 

<  v^  (x,t)  >  =  J  Tj  (E,t)  dz  (3.20) 


and 


<  r‘  (x,t)  >  = 


Zo-Az/2 


(3.21) 


where 


^2  (THN)^ 

’  [  1  +  (7HAz/a^)^/12  ] 


and  Az  is  the  vertical  extent  of  the  Na  layer. 


(3.22) 


37 


The  horizontal  wavenumber  spectra  can  also  be  related  using  Equations  (3.14),  (3.15) 
and  (3.17).  If  the  Na  densi^  spectra  arc  computed  at  each  altitude  throughout  the  layer  and  then 
averaged,  the  horizontal  wind  spectrum,  E^,  and  the  atmospheric  density  spectrum  3a»  can  be 


shown  as 


and 


Zq-M1 


(3.23) 


(3.24) 


where  R,  is  Fourier  transform  of 


r+Ax/2 

rs(x,z,t)  e*^*' 


dx 


(3.25) 


Xo-Ax/2 


and  Ax  is  the  horizontal  extent  of  the  Na  observations.  The  horizontal  wind  spectrum  and 
atmospheric  density  spectrum  can  be  also  calculated  for  a  specific  altitude  range,  i.e.,  the  layer 
bottomside  (firom  the  bottomedge  to  the  centroid),  or  the  layer  topside  (from  the  centroid  to  the 
topedge  of  the  layer).  The  bottomside  horizontal  wind  spectrum,  Ei,x>  and  the  bottomside 
atmospheric  density  spectrum,  E]^,  are 


Eb: 


Zo-Az/2 


Ax 


dz 


(3.26) 


EfaaC^x)  -  2 


< 


Ax 


(3.27) 


where 


2^ _ (YHN)^ _ 

^  [  1  +  yHAz/2<tJ  +  (7HAz/Oq)^/1  2  ] 


(3.28) 


38 


The  topside  horizontal  wind  spectrum,  £^35,  and  the  topside  atmospheric  density  spectram,  Eja. 
are 


EtaO^x) 


■  IyhnJ 


<  I  R^(k„z,t)  r  > 
Ax 


(3.30) 


where 


^2^ _ mNf _ 

‘  ( 1  -  7HAz/20q  +  (yHAz/oJ)^/12  ] 


(3.31) 


If  the  observadon  aldtude  range  Az  is  large  compared  to  the  verdcal  correladon  length  of  the 
waves,  then  the  verdcal  wavenumber  spectra  arc  related  by  the  following  equadons  [Gardner  and 
Senft,  1989]. 


x^-6xa 


(3.33) 


where 


r+Ax/2 

rs(x,z,t)  e“^  dz 

Zo*^W2 


(3.34) 


(^asi-monochromatic  gravity  waves  are  drequendy  observed  in  the  Na  density  profiles. 
For  the  case  of  low-frequency  monochromatic  waves,  exact  solutions  of  Equations  (3.2)  and 
(3.3)  for  <{>  and  were  derived  by  Gardner  and  Skelton  [1985]. 


39 


(3.35) 


9,  =  ta[ 


1  + 


— cos(<ot  -  to) 
Y-1 


1  + 


—  C0S(0)t  -  k*IL) 

Y-1 


] 

1 


(3.36) 


where  Ae^  s  wave  axnplitude 

B  s  amplitude  growth  factor  (m'^) 
i;  =  =  wavenumber  vector  (m*^) 

k;^  s  horizontal  wavenumber  (m*^ 

-  vertical  wavenumber  (m'b 
CO  *  wave  frequency  (s'b. 

The  corresponding  vertical  and  horizontal  winds  generated  by  an  unsaturated  gravity  wave  are 
given  by  [Hines,  1960] 

V  a .  ^  Ae®'  sin(cot  -  kiz)  (3.37) 

Y-1  K 


V  ~ ^  Ae®*  sin((ot  -  k*B)  (3.38) 

Y-1  >.x 

The  vertical  and  horizontal  wavelengths  are,  respectively,  Xj  and  By  substituting  Equation 
(3.35)  into  (3.14)  and  evaluating  the  horizontal  wavenumber  spectrum  given  by  Equation  (3,23) 
at  the  horizontal  wavenumber  of  the  wave,  it  can  be  shown  that 


Ex(kx)  X 


cosh(BAz)  - 


2 

2ao 

yHAz 


sinh(BAz) 


(3.39) 


Equation  (3.39)  can  be  used  to  calculate  the  horizontal  wind  velocity  of  monochromatic  gravity 
waves  by  measuring  the  amplitudes  of  the  spectral  peaks  in  the  horizontal  wavenumber 
spectrum. 


40 


3.3  Description  of  the  Experiment 

In  early  November  of  1986,  the  UIUC  lidar  system  was  installed  on  the  Electra  aircraft 
operated  by  the  NCAR-RAF.  The  system  included  a  flashlamp-pumped  dye  laser,  a  35  cm 
diameter  Cassegrain  telescope,  and  associated  electronic  and  optical  subsystems.  Figure  3.1  is  a 
photograph  of  the  NCAR  Electra,  and  Figure  3.2  is  a  photograph  of  the  Electra  interior  showing 
die  lidar  installadon.  The  major  lidar  and  aircraft  parameters  were  summarized  in  Table  2.1. 
Following  the  installadon,  a  total  of  three  flights  were  conducted  out  of  Stapleton  Intemadonal 
Airport  in  Denver  from  November  13  to  18, 1986.  The  ground  tracks  of  these  flights  were 
illustrated  in  Figure  2.4.  The  flights  and  system  performance  were  summarized  in  Table  2.2. 
During  the  three  flights,  a  total  of  425  Na  density  profiles  were  collected.  The  integradon  period 
for  each  profile  was  100  s.  The  horizontal  distance  corresponding  to  this  integradon  period 
ranged  from  1 1  to  20  km,  depending  on  the  ground  speed  of  the  aircrafL 

The  aircraft  posidon  and  aldtude  were  monitored  and  recorded  during  the  flights.  The 
aircraft  longitude  and  ladtude  were  measured  with  an  Inertial  Navigation  System  (INS),  and  the 
aldtude  was  measured  with  a  pressure  transducer.  The  INS  was  also  used  to  measure  the  roll 
and  pitch  angles  of  the  aircraft,  and  flight  directions.  The  resolution  of  the  latitude  and  longitude 
measurements  was  0.0014®,  and  the  resolution  of  the  roll  and  pitch  angles,  and  flight  direction 
was  0.0028® .  Radiosonde  data  were  also  collected  at  5  different  locations  near  the  flight  tracks; 
Peoriu,  Illinois  (41®N,  90°W),  Topeka,  Kansas  (39®N,  96°W),  Denver,  Colorado  (40®N, 
105®W),  Salt  Lake  City,  Utah  (41®N,  1 12®W),  and  Medford,  Oregon  (42°N,  123®W).  The 
radiosonde  data  revealed  that  the  error  of  the  measured  pressure  altitude  due  to  meteorological 
variations  was  negligible  during  the  flights.  The  roll  angle  of  the  aircraft  was  usually  less  than 
1®,  and  the  pitch  angle  was  relatively  constant  at  about  3®.  Therefore,  the  laser  beam  was  not 
directed  vertically  so  that  the  geographical  coordinates  of  the  aircraft  were  different  from  the 
coordinates  of  the  locations  where  the  laser  beam  propagated  through  the  Na  layer.  The 


41 


Figure  3.2.  A  photograph  of  the  interior  of  the  NCAR  Electra  showing  the  lidar  installation. 


43 


geogrq)hical  coordinates  and  orientadon  of  the  aircraft  were  used  to  compute  the  geogn^hical 
coordinates  of  the  Na  measurements  before  the  Na  data  were  processed. 

3.4  Results 

In  this  secdon,  the  data  collected  over  the  long  baselines  of  the  eastward  and  westward 
flights  will  be  presented.  Both  flights  included  one  eastbound  and  one  westbound  leg.  During 
the  eastward  flight,  the  Na  layer  appeared  to  be  influenced  by  quasi-monochromadc  waves.  To 
illustrate  the  effects  of  the  waves,  a  sequence  of  Na  density  profiles  is  plotted  versus  longitude  in 
Figure  3.3.  These  profiles  were  collected  during  the  eastbound  leg.  The  profiles  were  filtered 
vertically  with  a  cutoff  of  4  km  and  horizontally  with  a  cutoff  of  70  km.  The  profiles  were  then 
normalized,  so  that  each  had  the  same  column  abundance.  The  wavelike  features  in  the  layer 
appear  to  be  repeddve.  For  example,  the  profiles  near  99-100*^  are  similar  to  those  near  91- 
92*^.  The  horizontal  separadon  distance  between  these  two  groups  of  the  profiles  is 
approximately  650  km  and  appears  to  be  reltued  to  the  horizontal  wavelength  of  a  dominant 
wave. 

The  verdcal  and  longitudinal  variadons  of  the  reladve  density  perturbadons  also  exhibited 
wavelike  features.  Figure  3.4  shows  the  reladve  density  perturbadons  computed  at  each  aldtude 
from  90  to  97  km  at  increments  of  1  km.  Verdcal  and  longimdinal  progressions  of  wavelike 
features  are  clearly  seen  near  99®W  and  95®W. 

The  horizontal  wavenumber  spectrum  corresponding  to  the  profiles  from  the  eastbound 
leg  (Figure  3.3)  is  plotted  in  Figure  3.5.  The  profiles  were  first  filtered  vertically  with  a  cutoff  of 
1  km,  and  then  the  horizontal  wavenumber  spectrum  was  computed  at  each  aldtude  from  the 
bottom  to  the  top  of  the  layer  at  increments  corresponding  to  the  vertical  resolution  of  the  lidar 
(i.e.,  150  m).  The  specu^m  of  Figure  3.5  is  the  average  of  the  spectra  computed  from  the 
bottom  to  the  top  of  the  layer.  The  horizontal  wavenumber  is  the  inverse  of  longitudinal  distance 

431 

along  the  eastbound  leg.  In  order  to  estimate  the  spectral  slope,  a  linear  regression  fit  was 


44 


ALTITUDE  (km)  ALTITUDE  (km) 


Figure  3.3.  Sodium  density  profiles  collected  during  the  eastbound  leg  of  the  eastward  flight  on 
November  15, 1986.  The  profiles  were  filtered  vertically  with  a  cutoff  of  4  km  and 
horizontally  with  a  cutoff  of  70  km.  The  density  profiles  were  also  normalized  so 
that  each  had  the  same  column  abundance  and  were  plotted  on  a  linear  scale. 


45 


EASTWARD  FLIGHT  -  EASTBOUND  LEG 


Figure  3.4.  Vertical  and  longitudinal  variations  of  the  relative  density  perturbations  computed 
from  the  Na  lidar  data  collected  during  the  castbound  leg  of  the  eastward  flight  on 
November  15, 1986.  Before  the  data  were  computed,  the  density  profiles  were 
filtered  vertically  with  a  cutoff  of  1  km  and  horizontally  with  a  cutoff  of  70  km. 
Then  the  relative  density  perturbations  were  computed  and  filtered  horizontally  with 
a  cutoff  of  180  km. 


46 


RELATIVE  DENSITY  PERTURBATIONS 


HORIZOtfTAL  VELOCITY  SPECTRUM  ((m/s)  /  (cyc/m) 


EASTWARD  FLIGHT  -  EASTBOUND  LEG 


HORIZONTAL  WAVENUMBER  (cyc/m) 


Figure  3.5.  Horizontal  wavenumber  spectrum  for  the  eastbound  leg  of  the  eastward  flight  on 
November  15, 1986.  The  data  were  filtered  vertically  with  a  cutoff  of  1  km  before 
the  spectrum  was  computed.  The  straight  line  is  a  linear  regression  fit  which  was 
used  to  estimate  the  spectral  slope  over  horizontal  scales  from  70  to  700  km. 


ATMOSPHERIC  DENSITY  SPECTRUM  (m/cyc) 


performed  over  the  wavenumber  range  from  1.43x10*^  to  1.43x10*5  m*^  which  corresponds  to 
horizontal  scales  ranging  from  70  to  700  km.  The  straight  line  drawn  in  Figure  3.5  is  the  linear 
fit,  and  the  slope  is  approximately  -1.41. 

The  average  vertical  wavenumber  spectrum  corresponding  to  the  profiles  of  the 
eastbound  leg  is  plotted  in  Figure  3.6.  The  Na  density  profiles  were  first  filtered  horizontally 
with  a  cutoff  of  70  km,  and  then  the  vertical  wavenumber  spectrum  was  computed.  The  shot 
noise  level  was  estimated  and  subtracted  from  the  spectrum,  and  the  results  were  then  plotted  in 
Figure  3.6.  The  straight  line  is  the  linear  regression  fit  The  spectral  slope  is  estimated  to  be 
-2.80  over  the  wavenumber  range  from  lO*^  to  5x10*^  mrl,  which  corresponds  to  vertical  scales 
ranging  from  2  to  10  km. 

A  sequence  of  density  profiles  collected  during  the  westbound  leg  of  the  eastward  flight 
is  plotted  versus  longitude  in  Figure  3.7.  The  data  were  processed  in  the  same  manner  as  those 
plotted  in  Figure  3.3.  At  the  flight  altitude  of  approximately  8  km,  the  prevailing  winds  were 
eastward.  The  ground  speed  of  the  aircraft  was  much  slower  during  the  westbound  leg.  The 
ground  speeds  were  approximately  181  m  s*^  eastbound  and  121  m  s*^  westbound.  Also,  the 
baseline  of  the  westbound  leg  was  slightly  longer,  because  the  westbound  flight  path  extended 
past  Denver  and  over  the  front  range  of  the  Rockies.  Therefore,  about  twice  as  many  profiles 
were  collected  during  the  westbound  leg.  In  Figure  3.7,  the  last  profile  near  90°W  was  the  first 
observed  profile  of  the  westbound  leg,  because  the  aircraft  was  proceeding  westward.  Note  that 
the  features  of  the  profiles  observed  on  the  westbound  leg  near  105®W  (Figure  3.7)  are  quite 
similar  to  those  observed  on  the  eastbound  leg  near  99®W  (Figure  3.3).  These  features  appear  to 
be  propagating  westward. 

The  westward  propagating  features  are  also  seen  in  the  vertical  and  longitudinal  variations 
of  the  relative  density  perturbations  computed  from  the  Na  profiles  measured  on  the  westbound 
leg  and  plotted  in  Figure  3.8.  The  maxima  of  the  perturbations  near  96®W  in  Figure  3.4  appear 
to  have  propagated  westward  to  98®  in  Figure  3.8. 


48 


HORIZOhfTAL  VELOCITY  SPECTRUM  ( (m/s)  /  (cyc/m)  ) 


EASTWARD  FLIGHT 


EASTBOUND  LEG 


10 


-1 


Figure  3.6.  Vertical  wavenumber  spectrum  for  the  eastbound  leg  of  the  eastward  flight  on 
November  15, 1986.  Tlie  data  were  filtered  horizontally  with  a  cutoff  of  70  km 
before  the  spectrum  was  computed.  Then  the  shot  noi.se  level  was  estimated  and 
subtracted  from  the  spectrum  before  the  spectral  slope  was  estimated.  The  straight 
line  is  a  linear  regression  fit  which  was  used  to  estimate  the  spectral  slope  over 
vertical  scales  from  2  to  10  km. 


49 


ATMOSPHERIC  DENSITY  SPECTRUM  (m/cyc) 


altitude  (km)  ALTITUDE  (km)  ALTITUDE  (km)  ALTITUDE  (km) 


EASTWARD  FLIGHT -WESTBOUND  LEG 


(0042  MST) 


LONGITUDE  (“W)  ^2355  mst) 


Figure  3.7.  Sodium  density  profiles  collected  during  the  westbound  leg  of  the  eastward  flight 
on  November  15-16, 1986.  The  profiles  were  processed  in  the  same  manner  as 
those  of  the  eastbound  leg  of  the  eastward  fiight  plotted  in  Figure  3.3. 


50 


EASTWARD  FLIGHT  -  WESTBOUND  LEG 


LONGITUDE  (°W) 


Figure  3.8.  Vertical  and  longitudinal  variations  of  the  relative  density  penurbations  computed 
from  the  Na  lidar  data  collected  during  the  westbound  leg  of  the  eastward  flight  on 
November  15-16, 1986.  To  compute  these  relative  density  perturbations,  the  Na 
profiles  were  first  filtered  vertically  with  a  cutoff  of  1  km  and  horizontally  with  a 
cutoff  of  70  km.  Then  the  relative  density  perturbations  were  computed  and  filtered 
horizontally  with  a  cutoff  of  233  km. 


51 


RELATIVE  DENSITY  PERTURBATIONS 


The  horizontal  wavenumber  spectrum  corresponding  to  the  profiles  of  the  westbound  leg 
is  plotted  in  Figure  3.9.  The  profiles  were  again  vertically  filtered  with  a  cutoff  of  1  km  before 
the  spectrum  was  computed.  Comparison  of  the  spectra  of  Figure  3.9  and  Figure  3.5  reveals 
two  main  features.  First,  the  amplitudes  of  the  spectral  peaks  for  wavenumbers  lower  than 
7x10^  m*^  are  in  general  larger  for  the  eastbound  observations  (Figure  3.5)  than  for  the 
westbound  observations  (Figure  3.9).  On  the  other  hand,  the  amplitudes  of  the  peaks  for 
wavenumbers  higher  than  7x10*^  m*l  are  smaller  for  the  eastbound  observations  (Figure  3.5) 
than  for  the  westbound  observations  (Figure  3.9).  Second,  the  spectral  slope  for  the  westbound 
observations  is  estimated  to  be  -1.12  at  horizontal  scales  from  70  to  700  km,  which  is  much 
smaller  than  the  slope  for  the  eastbound  observations.  The  differences  in  the  spectral  amplitudes 
and  slopes  could  be  related  to  the  Doppler  effects.  For  example,  the  spectral  energy* 
corresponding  to  large  scale  westward  propagating  waves  would  be  Doppler  shifted  to  lower 
wavenumbers  for  the  westbound  observations  in  Figure  3.9,  while  energy  corresponding  to 
small  scale  eastward  propagating  waves  could  be  shifted  to  higher  wavenumbers  for  the 
westbound  observations.  The  result  would  be  a  lower  spectral  slope  for  the  westbound 
observations  compared  to  the  eastbound  observations. 

The  westward  propagating  waves  are  also  seen  in  the  wavelike  variations  of  the  centroid 
height  of  the  Na  layer  plotted  in  Figure  3,10.  Notice  ti’at  the  centroid  maxima  and  minima 
measured  on  the  westbound  leg  are  separated  by  larger  distances  than  those  measured  on  the 
eastbound  leg.  It  appears  that  the  waves  which  induced  the  centroid  height  perturbations  were 
propagating  westward.  In  fact,  the  centroid  maxima  and  minima  measured  on  the  eastbound  leg 
can  be  matched  with  those  measured  on  the  westbound  leg.  The  corresponding  maxima  and 
minima  are  marked  in  Figure  3.10.  From  the  measurement  times  and  horizontal  separation 
distances  between  the  corresponding  maxima  (or  minima),  westward  velocities  were  computed 
and  summarized  in  Table  3.1. 

It  appears  that  at  least  two  different  waves  were  dominating  the  Na  layer  during  the 


52 


HORIZONTAL  VELOCITY  SPECTRUM  ( (m/s)  / (cyc/m) ) 


EASTWARD  FLIGHT  -  WESTBOUND  LEG 


HORIZONTAL  WAVENUMBER  (cyc/m) 


Figure  3.9.  Horizontal  wavenumber  spectrum  for  the  westbound  leg  of  the  eastward  flight  on 
November  15-16,  1986.  The  data  were  processed  in  the  same  manner  as  the  data 
of  the  eastbound  leg  of  the  eastward  flight. 


53 


ATMOSPHERIC  DENSITY  SPECTRUM  (m/cyc) 


CENTROID  HEIGHT  (km) 


91.  6 

91.  2 

90.  8 

90.  4 

90.  0 

89.  6 

89.  2 


106  104  102  100  98  96  94  92  90 


LONGITUDE  (°W) 


Figure  3.10.  The  longitudinal  variations  of  the  centroid  height  of  the  Na  layer  observed  during 
the  eastward  flight  on  November  15-16, 1986. 


Table  3.1.  Horizontal  Velocities  of  the  Maxima  and  Minima  of  the 
Centroid  Height  of  the  Na  Layer  Observed  During 
the  Eastward  Flight  on  November  15>16,  1986 


A 

B 

C 

D 

E 

F 

G 


Eastbound  Leg  Westbound  Leg 


Feature 

Longitude 

CW) 

Time 

(MST) 

Longitude 

CW) 

Time 

(MST) 

Westward 
Velocity  (m  s‘^) 

Maximum 

99.53 

22:38 

104.60 

02:46 

29.6 

Minimum 

97.67 

22:53 

102.52 

02:22 

33.6 

Maximum 

96.27 

23:04 

101.88 

02:14 

42.9 

Minimum 

95.27 

23:12 

99.92 

01:51 

42.6 

Maximum 

94.20 

23:20 

97.52 

01:22 

39.3 

Minimum 

93.20 

23:27 

95.64 

01:01 

37.6 

Maximum 

92.20 

23:35 

93.16 

00:32 

24.5 

55 


flight  One  wave  appeared  to  have  a  much  shorter  zonal  wavelength  than  the  other.  The  zonal 
wavelength  of  the  shorter  wave  is  related  to  the  separation  distance  between  two  centroid 
maxima,  C  and  E,  which  was  approximately  180  km  on  the  eastbound  leg  and  380  km  on  the 
westbound  leg.  The  wavelength  of  the  longer  wave  is  related  to  the  distance  between  centroid 
maxima,  A  and  G,  which  was  640  km  on  the  eastbound  leg  and  about  1000  km  on  the 
westbound  leg.  The  horizontal  wavenumber  spectra  for  the  eastbound  and  westbound 
observations  exhibited  spectral  peaks  corresponding  to  these  two  waves.  The  shorter 
wavelength  wave  corresponds  to  spectral  peaks  C  in  Hgures  3.5  and  3.9,  and  the  longer 
wavelength  wave  corresponds  to  spectral  peak  A  in  Figure  3.5.  The  spectral  peak  for  the  longer 
wavelength  wave  could  not  be  seen  in  the  spectrum  for  the  westbound  observations,  because  the 
Doppler  shifted  zonal  wavelength  of  this  wave  was  too  close  to  the  flight  leg  distance  of  1295 
km.  However,  the  spectral  amplitudes  at  the  wavenumbers  near  (1000  km)*^  appear  to  be  much 
larger  in  Figure  3.9  for  the  westbound  leg  compared  to  the  eastbound  leg. 

There  are  four  distina  spectral  peaks,  A  through  D,  in  the  horizontal  wavenumber 
spectrum  for  the  eastbound  observations  (Figure  3.5),  and  three  distinct  peaks,  B  through  D,  in 
the  horizontal  wavenumber  spectrum  for  the  westbound  observations  (Figure  3.9).  Spectral 
peaks  B  and  D  in  Figure  3.5  appear  to  be  sidelobes  of  spectral  peaks  A  and  C,  respectively. 
Similarly,  spectral  peak  D  in  Figure  3.9  appears  to  be  a  sidelobe  of  spectral  peak  C,  and  spectral 
peak  B  also  appears  to  be  a  sidelobe  which  corresponds  to  the  longer  wavelength  wave.  The 
wavenumber  separation  between  the  main  spectral  peak  and  the  Hrst  sidelobe  of  a  monochromatic 
wave  is  related  to  the  horizontal  extent  of  the  observations  by 

Ak=—  (3.40) 

Ax 

where  Ak  is  the  wavenumber  separation  between  the  main  peak  and  the  first  sidelobe. 

The  horizontal  extent.  Ax,  for  the  eastbound  obsen/ations  was  1060  km,  and  that  for  the 
westbound  observations  was  1300  km.  By  using  Equation  (3.40),  the  wavenumber  separation 


56 


between  the  main  peak  and  the  first  sideloue  is  calculated  to  be  1.4x10^  m*^  for  the  eastbound 
observations,  and  1.2xl0‘*  m*^  for  the  westbound  observations.  The  wavenumber  separation 
between  spectral  peaks  A  and  B  in  the  spectrum  for  the  eastbound  observations  (Figure  3.5)  is 
1.4x10*®  m•^  and  the  separation  between  spectral  peaks  C  and  D  is  again  1.4x10*®  m*i.  The 
wavenumber  separation  between  spectral  peaks  C  and  D  in  the  spectmm  for  the  westbound 
observations  (Figure  3.9)  is  10*®  m**. 

The  observed  Doppler  shifted  wavelengths  of  the  two  waves  are  related  to  the  intrinsic 
zonal  wavelengths  by  the  following  equation. 


(3.41) 


where  Xin  =  intrinsic  wavelength  along  the  flight  path, 

Xob  =  observed  wavelength  along  the  flight  path, 

Vob  -  Vp+Vb  =  observed  phase  velocity  along  the  flight  path, 

Vp  s  intrinsic  phase  velocity  along  the  flight  path, 

Vb  ~  background  atmospheric  wind  velocity  along  the  flight  path  at  the  altitudes  of  the 
Na  layer,  and 
Va  =  aircraft  velocity. 

The  background  wind  velocity  was  measured  at  Platteville,  Colorado  with  an  ST  radar  during  the 
period  from  November  4  to  20, 1986.  The  average  zonal  wind  velocity  was  about  5  m  s*^ 
eastward  over  the  altitude  range  of  the  Na  layer  (see  Section  4.3). 

Because  observations  were  made  over  both  the  eastbound  and  westbound  flight  legs,  and 
the  aircraft  velocity  is  known.  Equation  (3.41)  can  be  solved  for  both  the  intrinsic  wavelength 
and  observed  phase  velocity  along  the  flight  path.  The  intrinsic  wavelength  along  the  flight  path 
is 


X^ob  X^b  (V^  -  VD 
XSbVa  -  X'^b^ 


s 

(j.4z; 


57 


where  »  wavelength  observed  during  the  eastbound  flight  leg, 

Xob  =  wavelength  observed  during  the  westbound  flight  leg, 

V|  -  average  aircraft  velocity  during  the  eastbound  flight  leg,  and 
Va  -  average  aircraft  velocity  during  the  westbound  flight  leg. 
Similarly,  the  observed  pha.se  velocity  along  the  flight  path  is 


y  V|  Va  (A»ob  ~  A.  ob) 
A^VSf-X'^obVS 


(3.43) 


By  using  the  two  wavelengths  of  spectral  peaks  C  from  Figure  3.S  and  3.9,  the  intrinsic  zonal 
wavelength  of  the  shoner  wavelength  wave  is  computed  to  be  263  km.  The  observed  phase 
velocity  is  approximately  38  m  s’l  westward,  which  is  quite  comparable  to  the  westward 
velocides  of  centroid  maxima,  C  and  E,  and  minima,  D  and  F  listed  in  Table  3.1.  By  subtracting 
the  background  wind  velocity  from  the  observed  phase  velocity,  the  intrinsic  zonal  phase  velocity 
is  calculated  to  be  43  m  s*l  westward.  The  intrinsic  wave  period  is  calculated  to  be  102  min. 

From  ground'based  Na  iidar  observations  obtained  at  Urbana,  Illinois,  Gardner  and 
Voelz  [1987]  found  surprisingly  systematic  relationships  between  horizontal  and  vertical 
wavelengths  and  the  observed  periods  of  quasi-monochromatic  waves.  According  to  the 
Gardner  and  Voelz  [1987]  empirical  relationships,  an  observed  period  of  102  min  corresponds  to 
a  horizontal  wavelength  of  101  km  and  a  vertical  wavelength  of  4.6  km.  The  zonal  wavelength 
will  be  larger  than  the  intrinsic  horizontal  wavelength  if  the  wave  is  not  propagating  zonally.  The 
relationship  between  the  intrinsic  horizontal  wavelength  and  zonal  wavelength  is 


A.zonal  ~ 


cosa 


(3.44) 


where  a  is  the  angle  between  the  wave  propagation  direction  and  the  east-west  lit  e.  By 
assuming  the  intrinsic  horizontal  wavelength  is  approximately  equal  to  the  value  predicted  by  the 
Gardner  and  Voelz  [1987]  empirical  relationship,  the  angle,  ot,  is  calculated  from  Equation 


58 


(3.44)  to  be  approximately  70®.  Thus  the  wave  appears  to  be  propagating  at  an  approximate 
azimuth  angle  of  either  200®  or  340®. 


The  parameters  of  the  longer  wavelength  wave  can  be  also  estimated  by  using  Equations 
(3.42)  and  (3.43).  The  observed  wavelength  for  the  eastbound  observations  was  660  km,  which 
is  the  wavelength  of  spectral  peak  A  in  the  horizontal  wavenumber  spectrum  (Figure  3.5).  For 
the  westbound  observations,  the  wavelength  is  estimated  from  the  centroid  height  data  to  be 
approximately  1030  km.  By  using  these  two  horizontal  wavelengths,  the  intrinsic  zonal 
wavelength  is  calculated  to  be  772  km,  and  the  observed  phase  velocity  is  approximately  30 
m  s*^  westward-  This  velocity  is  comparable  to  the  westward  velocities  of  centroid  maxima  A 
and  G  and  minimum  B  (see  Table  3.1).  The  intrinsic  period  is  computed  to  be  363  min  or  6. 1 
hours.  This  period  of  6.1  hours  strongly  suggests  that  the  wave  is  a  westward  propagating  tide. 
The  vertical  wavelength  of  this  wave  is  related  to  the  intrinsic  horizontal  wavelength  by  the 
gravity  wave  dispersion  relation. 


(3.45) 


where  Tb  =  Brant-Vaisala  period  (5  min), 

Tin  =  intrinsic  wave  period. 

The  intrinsic  horizontal  wavelength  of  the  wave  was  not  measured  during  the  flight.  However, 
by  using  the  intrinsic  zonal  wavelength  of  772  km  and  the  intrinsic  period  of  6.1  hours,  the 
vertical  wavelength  is  calculated  to  be  10.6  km.  From  the  systematic  relationships  obtained  from 
the  ground-based  Na  lidar  observations  by  Gardner  and  Voelz  [1987],  an  observed  period  of  6.1 
hours  would  correspond  to  a  horizontal  wavelength  of  734  km  and  a  vertical  wavelength  of  9.8 
km.  'fhese  horizontal  and  vertical  wavelengths  are  quite  comparable  to  those  calculated  from  the 
airborne  data,  and  provide  further  support  for  the  assumption  that  this  wave  was  propagating 
approximately  due  west  The  theoreucal  results  of  Gardner  and  Shelton  [1985]  and  Gardner  et 
al.  [1986]  indicate  that  the  Na  layer  centroid  height  perturbations  are  largest  for  waves  with 


59 


vertical  wavelengths  between  10  and  20  km  and  for  vertical  wavelengths  greater  than  35  km. 

The  peak-to-peak  centroid  height  variadons  caused  by  the  6.1-hour  period  wave  are  almost  1  km 
for  the  westbound  flight  leg.  The  Platteville,  Colorado  radar  also  measured  a  strong  6-hour 
period  wind  oscillation  near  90  km  during  the  period  November  4  to  20, 1986  (see  Secdon  4.3). 

The  wave  parameters  including  hwizontal  wind  amplitudes  corresponding  to  spectral 
peaks  A  and  C  are  summarized  in  Table  3.2.  The  horizontal  wavelengths  of  the  waves  observed 
with  the  ground-based  Na  lidar  [Gardner  and  Voelz,  1987]  are  plotted  versus  period  in  Figure 
3.1 1.  The  intrinsic  zonal  wavelengths  and  intrinsic  periods  measured  during  the  eastward  flight 
are  also  plotted  in  Hgure  3. 1 1 .  The  straight  line  is  a  maximum  likelihood  linear  regression  fit 
which  was  used  to  estimate  the  slope  for  the  ground-based  observations.  As  discussed  earlier, 
the  intrinsic  zonal  wavelength  of  the  6.1-hour  period  wave  appears  to  be  quite  comparable  to  the 
horizontal  wavelengths  estimated  from  the  ground-based  observations  for  the  same  period.  The 
wave  corresponding  to  spectral  peak  C  appears  to  have  propagated  at  an  azimuth  angle  of 
approximately  2QGP  or  340”.  The  horizontal  distance  between  the  location  of  the  airborne  lidar 
observations  and  the  location  of  the  source  of  a  wave  can  be  estimated  roughly  by  multiplying  the 
height  of  the  Na  layer  by  the  ratio  of  the  intrinsic  horizontal  wavelength  to  the  vertical  wavelength 
of  the  observed  wave.  For  the  wave  corresponding  to  spectral  peak  C,  the  average  horizontal 
distance  to  the  tropospheric  source  was  computed  to  be  about  2000  km.  Interestingly,  during  the 
period  from  November  13  to  19,  a  strong  meteorological  low  pressure  system  developed  over 
northern  Hudson  Bay,  Canada.  The  azimuth  angle  of  a  line  drawn  from  northern  Hudson  Bay  to 
the  average  horizontal  location  of  the  flight  track  is  about  200”,  and  the  horizontal  distance  is 
about  2500  km.  Although  it  is  speculative,  this  large  stationary  low  pressure  system  might  be 
responsible  for  the  wave  corresponding  k>  spectral  peak  C. 

The  kinetic  energy  distribution  for  the  waves  observed  with  the  airborne  Na  lidar  is 
plotted  versus  temporal  frequency  in  Figure  3.12  along  with  the  ground-based  data  reponed  by 
Gardner  and  Voelz  (1987].  The  kinetic  energy  for  the  airborne  measurements  was  calculated  by 


60 


Table  3.2.  Monochromatic  Wave  Parameters  Estimated 
from  the  Horizontal  Wavenumber  Spectra  for  the 
Eastward  Flight  on  November  15*16,  1986 


Spectral  Peak 

A 

C 

Observed  Zonal  Wavelength 

Eastbound  Observations  (km) 

660 

217 

Westbound  Observations  (km) 

1030a 

382 

Intrinsic  Zonal  Wavelength  (km) 

772 

263 

Observed  25onal  Phase  Velocity 
(m  s*^  westward) 

30 

38 

Intrinsic  Zonal  Phase  Velocity 
(m  s*l  westward)^ 

35 

43 

Intrinsic  Period  (min) 

363 

102 

Horizontal  Wind  Amplitude 

Eastbound  Observations  (m  s*l) 

12.7 

6.1 

Westbound  Observations  (m  s*^) 

NA 

7.5 

^Estiniated  from  the  centroid  height  variations  measured  on  the  westbound  leg. 

*^Calculated  assuming  background  wind  of  5  m  s**  eastward.  This  value  was  the  measured 
average  zonal  wind  at  Plattcville,  Colorado  during  the  period  from  November  4  -  20, 1986. 


61 


Slope  =  1. 57 
Coefficient  =  0. 069 


(uiH)  H19N31HAVn  IVINOZIHOH 


62 


CO 

1 


I 


n 

o 

>> 

o 


> 

u 

z 

UJ 

=3 

O 

Ui 

cc 

u* 


vC  ^ 

I  "o 

‘35  c 

a  u  c3 

=rJi 


cJi 


I'S  § 
i‘|  ^ 

w  ^  C 

v.S‘5 

3  o  }s 

i2  (rt  f «) 

«  c  'J 

>  o 

-3  ■=  2i 

=  t  c-c 
•S  vC  c  o  ^ 

V5  O  ^  3  -S  ^ 

^  u  -  .-S  .2?  « 
«-5  2 

eo 

"  Z  0  ^  5  C 

>  “^  —  Mi  = 

2-^  £  y  «:E 

o  s  ;j| 

■ii  £  ^  •_)  ;:  t: 
S  73  c  "S  3) 
C  c  ^  u  to  es 
5  i  S  =  c  « 

S  p  « jc  ‘C  E 

■§  6b  2o 

8 


*-=  73 
•S  </} 

C3  I 

h  "O 

V)  C 


1)  _ 

■S  ^-a 

®  S  2  5 
P  6b 

.  u 
soj: 

flj  — 
W 

^  t2 

gw 
3  .3 

•i  = 

•ys  C 


«  2 
■S  50 

so  E 
c  u 

•=  s 

•o  .E 

■a : 


y  "E  2  ^ 


c  ^  «  Z  "B 
o  -  c  c  c  « 

?  S  ?  Si  S-S 

55  2^  u  H  s  .<2 
2  E-S  §•!  « 

5r  S'  50  V)  y 

t.  .  2"  e  V)  1>  W* 
°  1’“ 

c  r  >  O  .  S  5 

«o  y-S 

”3  c''  a 

•  —  oc 


2  — I 
s 

«  " 


_  73  S 

«  s 


V  — 


y  rj  . 
r*  £  — 


in 

I 


=  C3 

3  es  ^  _ _ 

y  (TJ3  ^  S  00 

r= 

f-  <C  3  ^  C  — 


^  VC 


•Si  >,=  £  g  y  «  ^ 

y  y  “  —  y  p  B  c  5 

ccrt  -rt”*—  i)w- 


c 

V 

V) 

=  H 
o  H 


y 

r“ 

c 

y 


CO 


(  S  UI) 

2-  2- 


CM 


A9H3N3  DIi3NI)i 


'j 

u 

50 


63 


using  Equation  (3.39)  and  assuming  that  QAz  <  1  for  each  wave.  The  kinetic  energies  of  the 
waves  observed  with  the  airborne  lidar  air,  slightly  lower  than  those  observed  with  the  ground- 
based  lidar. 

The  average  vertical  wavenumber  spectrum  for  the  westbound  leg  is  plotted  in 
Figure  3.13.  The  slope  is  approximately  -2.68  at  vertical  scales  hpom  2  to  10  km,  and  is 
shallower  than  the  slope  of  the  vertical  wavenumber  spectrum  measured  on  the  eastbound  leg. 
The  vertical  wavenumber  spectrum  for  the  westbound  leg  was  larger  in  amplitude  than  that  for 
the  eastbound  leg  for  most  wavenumbers.  The  vertical  wavelength  of  the  dominant  wave  in  the 
spectrum  is  about  9.6  km,  which  appears  to  be  the  vertical  wavelength  of  the  wave  with  the 
period  of  6.1  hours. 

Figures  3.14  and  3.15  show,  respectively,  the  horizontal  and  vertical  wavenumber 
spectra  computed  from  the  data  collected  during  the  eastbound  and  westbound  legs  of  the 
westward  flight.  The  difference  in  the  horizontal  wavenumber  spectra  plotted  in  Figures  3.14  a 
and  b  may  be  caused  by  Doppler  effects  and  localized  gravity  wave  packets.  The  vertical 
wavenumber  spectra  for  both  flight  legs  are  plotted  in  Figures  3.15  a  and  b.  The  vertical 
wavenumber  spectrum  for  the  westbound  leg  was  smaller  in  amplitude  for  most  wavenumbers, 
while  the  slope  for  the  westbound  leg  was  steeper.  The  vertical  wavelengths  of  the  dominant 
waves  in  both  vertical  wavenumber  spectra  were  again  9.6  km. 

In  Figure  3.16,  the  rms  horizontal  wind  velocities  measured  on  both  eastward  and 
westward  flights  are  plotted.  During  the  eastward  flight,  the  wind  amplitudes  of  the  velocity 
perturbations  were  smaller  on  the  eastbound  leg  compared  to  amplitudes  measured  on  the 
westbound  leg.  However,  during  the  westward  flight  the  wind  amplitudes  were  larger  on  the 
eastbound  leg.  These  changes  of  the  amplitudes  could  be  caused  by  diurnal  variations  in  wind 
activity  and  the  differences  in  the  measurement  times  on  the  two  flights.  At  Urbana,  Illinois,  the 
ground-based  lidar  observations  have  reveaied  that  the  rms  horizontal  wind  velocity  typically 
increases  during  the  night  toward  the  early  morning.  Therefore,  this  temporal  change  could  be 


64 


HORIZONTAL  WAVENUMBER  (cyc/m) 


Figure  3. 14  b).  Horizontal  wavenumber  spectrum  for  the  westbound  leg  of  the  westward  flight 
on  November  17-18,  1986. 


HORIZOKTAL  VELOCITY  SPECTRUM  ((m/s)  / (cyc/m) 


WESTWARD  FLIGHT  -  EASTBOUND  LEG 


VERTICAL  WAVENUMBER  (cyc/m) 


Figure  3.15  a).  Vertical  wavenumber  spectrum  for  the  eastbound  leg  of  the  westward  tliglu  on 
November  17-18,  1986. 


Q£ 

O 

Z 


lo'"*  10 

VERTICAL  WAVENUMBER  (cyc/m) 


-3 


10' 


10" 


10" 


:^10 


Figure  3.15  b). 


I 


Venical  wavenumber  spectrum  for  the  westbound  leg  of  the  westward  flight 
November  17-18,  1986. 


69 


RMS  HORIZOHTAL  WIND  VELOCITY  (ms 


NOVEMBER  15-16  &  17-18,  1986 


LONGITUDE  (°W) 


Figure  3.16.  The  rms  horizontal  wind  velociries  inferred  from  the  airborne  .\'a  liclardata 

collected  during  the  eastward  (November  15-16,  1986)  ami  ue.stw.ird  (.November 
17-18,  1986)  nights. 


70 


RMS  ATMOSPHERIC  DENSITY  PERTURBATIONS  (%) 


responsible  for  the  changes  of  the  ims  wind  velocities  observed  during  the  flights.  The 
eastbound  leg  of  the  eastward  flight  and  the  westbound  leg  of  the  westward  flight  were  both 
conducted  during  the  late  evening.  Another  feature  of  Figure  3.16  is  that  the  cms  wind  velocity 
increased  in  general  with  longitude  from  the  Pacific  Coast  to  the  Great  Plains. 

In  Figure  3.17,  the  rms  horizontal  wind  velocities  measured  with  the  UIUC  ground- 
based  lidar  at  four  different  locations  along  with  the  velocities  measured  during  the  airborne 
observations  are  plotted  over  a  longitude  range  of  70*^  to  160^.  The  four  locations  include 
Mauna  Kea  Observatory  (MKO),  Hawaii  (20®N,155®W),  NCAR  in  Broomfield,  Colorado 
(40®N,  105*W),  UIUC,  Illinois  (40“N,  88®W),  and  Goddard  Space  Right  Center  (GSFC), 
Maryland  (39®N,  78®W).  The  measurements  at  Mauna  Kea  Observatory  were  obtained  on  three 
different  nights,  January  19, 20,  and  21, 1987.  The  three  dots  near  the  longimde  of  155“W  in 
Figure  3.17  indicate  the  average  values  of  the  rms  horizontal  wind  velocities  measured  on  the 
three  nights,  and  the  lines  indicate  the  ranges  of  the  velocities  measured  on  each  night  The 
measurements  at  Broomfield  were  obtained  on  the  nights  of  November  12  and  19, 1986.  The 
dots  near  the  longitude  of  105®W  again  indicate  the  average  values  measured  on  the  two  nights, 
and  the  lines  indicate  the  ranges  of  the  measured  values.  The  measurements  at  the  Goddard 
Space  Flight  Center  were  obtained  on  the  nights  of  October  16, 19,  and  21, 1981.  The  dots  and 
lines  near  the  longitude  of  78'’W  indicate  the  average  values  and  the  ranges  of  the  measured 
values,  respectively.  The  dot  near  the  longitude  of  88®W  represents  the  measurements  at  UIUC 
where  the  ground-based  lidar  observations  have  been  obtained  for  the  last  10  years.  The  rms 
horizontal  winds  measured  at  UIUC  exhibit  a  strong  semi-annual  oscillation  along  with  a  weak 
annual  oscillation  [Gardner  and  Senft,  1989].  A  regression  fit  was  performed  to  estimate  the 
amplitudes  and  phases  of  both  the  semi-annual  and  annual  oscillations  for  the  data  obtained  from 
1984  to  1986.  The  dot  in  Figure  3.17  near  88®W  represents  the  value  for  the  fit  corresponding  to 
the  middle  of  November.  The  error  bar  indicates  the  rms  difference  between  the  fitted  value  and 
the  measured  values.  The  magnitudes  of  the  rms  horizontal  wind  velocities  measured  in  Hawaii 


71 


(%)  SNOIlVSHniHad  AIISNaa  DIHaHdSOWIV  SWH 


TsOU 
=  U1D 

CQ  ^  NM  « 


o 

sz 


•a 
.  c 


rt 

•o 


iSo? 


ts  o  J2  U 


■I  >2Sz  5  «  5  c 

=  i2.^„-,e-=l 


T\  5bhM 

8s<;ot5  >  5i  a  2 
’<3*^0*t3  5  0’S  o  5 

c-n^^O  £  S 

■c3’9Io®o^^«  ge* 

•c  Ioq^  ”  ii'Z  “ 

■2  o^S?2l 
!2’S:=.J2^”r'^S  u- 
^  S  J  c  o  t  Q,:§  £ 

-  ea  •—  os  CO  00  .3  c 

iC-O  a  Sst 


o  P  eas< 


cn 

a 

3 

eo 

lb 


72 


are  quite  comparable  to  those  measured  in  Colorado,  Illinois,  and  Maryland,  and  during  the 
airborne  observations.  This  appears  to  indicate  that  the  magnitudes  of  horizontal  winds  over  the 
Pacific  Ocean  are  comparable  to  those  over  the  land  masses.  However,  it  should  be  noted  that 
the  observations  in  Hawaii  were  made  in  January  while  all  the  other  observations  were  obtained 
in  October  or  November. 

The  major  characteristics  of  the  vertical  and  horizontal  wavenumber  spectra  for  both 
eastward  and  westward  flights  are  summarized  in  Table  3.3.  Because  of  Doppler  efiects,  the 
slopes  and  amplimdes  of  the  horizontal  wavenumber  spectra  were  quite  difierent  depending  on 
flight  directions.  The  changes  of  the  spectra  could  be  also  related  to  the  changes  in  the  rms 
horizontal  wind  velocities;  the  spectral  slopes  were  steeper  when  the  wind  velocities  were 
smaller.  The  average  slope  of  the  horizontal  wavenumber  spectra  was -1.25.  The  spectral 
slopes  computed  for  only  the  bottomside  of  the  layer  were  consistently  steeper  than  those 
computed  for  the  topside.  This  is  illustrated  in  Figure  3.18  where  the  bottomside  and  topside 
spectra  for  the  eastbound  and  westbound  legs  of  the  eastward  flight,  and  the  eastbound  and 
westbound  legs  of  the  westward  flights  are  plotted.  Notice  that  the  two  quasi-monochromatic 
waves  corresponding  to  spectral  peaks  A  and  C  in  Figure  3.5  are  dominant  only  in  the 
bottomside  spectra  plotted  in  Figure  3.18  a  and  b.  The  topside  spectral  amplitudes  are  larger 
which  indicates  that  the  wave  amplitudes  were  growing  with  altitude.  The  shallower  slope  on  the 
topside  may  be  the  result  of  saturation  affecting  the  larger  scale  waves.  The  average  slope  for  the 
bottomside  spectra  for  all  four  flight  legs  was  -1.51  and  the  average  slope  for  the  topside  spectra 
was  -0.97.  The  changes  of  the  slopes  and  amplitudes  of  the  vertical  wavenumber  spectra  appear 
to  be  related  to  the  rms  horizontal  wind  velocities.  For  example,  during  the  eastward  flight  the 
spectral  slope  was  steeper,  the  spectral  amplitude  was  smaller,  and  the  velocities  were  smaller  on 
the  eastbound  leg  compared  to  the  westbound  leg.  The  average  slope  of  the  vertical  wavenumber 
spectra  for  all  flights  was  -2.67, 


73 


Table  3J.  Summary  of  the  Horizontal  and  Vertical  Wavenumber  Spectra 
Parameters  Measured  on  the  Eastward  and  Westward  Flights 


Flight  Eastward  Westward 


Direction  of  Flight 

Eastbound 

Westbound 

Eastbound 

Westbound 

Date 

Nov  15 

Nov  15-16 

Nov  18 

Nov  17-18 

TlmefMST) 

2210-2349 

2355-0255 

0130-0429 

2142-0116 

Flight  Duration  (hours) 

1.65 

3.00 

2.98 

3.57 

Flight  Distance  (km) 

1062 

1295 

1933 

1722 

Aircraft  Velocity^  (m  s*^) 

181±18 

121±18 

181±19 

135±20 

Vertical  Wavenumber  Spectrum 

Sloped 

-2.80 

-2.68 

-2.49 

-2.70 

Altitude  Range  (km) 

82-101 

82-102 

81-102 

83-99 

Horizontal  Wavenumber  Spectrum 

Slope® 

-1.41 

-1.12 

-1.03 

-1.43 

Altitude  Range  (km) 

82-101 

82-102 

81-102 

83-99 

Bottomside  Horizontal  Wavenumber  Spectrum 

Slope® 

-1.66 

-1.29 

-1.41 

-1.69 

Altitude  Range  (km) 

82-90 

82-90 

81-90 

83-90 

Topside  Horizontal  Wavenumber  Spectrum 

Slope® 

-1.09 

-1.01 

-0.71 

-1.05 

Altitude  Range  (km) 

90-101 

90-102 

90-102 

90-99 

^Ixjngitudinal  component  of  the  ground  velocity  of  the  aircraft. 

*^Spcctnd  slopes  computed  over  vertical  scales  ranging  from  2  to  10  km, 
^Spectral  slopes  computed  over  horizontal  scales  ranging  ftom  70  to  700  km. 


74 


HOR I ZOMTAL  VELOCITY  SPECTRUM  ( (m/s)  / (cyc/m) ) 


EASTWARD  FLIGHT  -  EASTBOUND  LEG 


HORIZONTAL  WAVENUMBER  (cyc/m) 


Figure  3.18  a).  Horizontal  wavenumber  spectra  computed  for  the  bottomside  and  topside  of  the 
Na  layer  measured  during  the  eastbound  leg  of  the  eastward  flight  on  November 
15-16,  1986. 


75 


ATMOSPHERIC  DENSITY  SPECTRUM  (m/cyc) 


HORIZONTAL  VELOCITY  SPECTRUM  ( (m/s)  / (cyc/m) 


EASTWARD  FLIGHT  -  WESTBOUND  LEG 


Figure  3. 1 8  b).  Horizontal  wavenumber  spectra  computed  For  the  bottomside  and  topside  of  the 
Na  layer  measured  during  the  westbound  leg  of  the  eastward  flight  on 
November  15-16,  1986. 


ATMOSPHERIC  DENSITY  SPECTRUM  (m/cyc) 


HORIZONTAL  VELOCITY  SPECTRUM  ( (m/s)  / (cyc/m) ) 


ATMOSPHERIC  DENSITY  SPECTRUM  (m/cyc) 


HORIZOKTAL  VELOCITY  SPECTRUM  ( (m/s)  / (cyc/m) ) 


WESTWARD  FLIGHT  -  WESTBOUND  LEG 


Figure  3. 1 8  d).  Honzontal  wavenumber  .specira  computed  for  the  bottomside  and  topside  of  the 
Na  layer  measured  during  the  westbound  leg  of  the  westward  flight  on 
November  17-18,  1986. 


78 


ATMOSPHERIC  DENSITY  SPECTRUM  (m/cyc) 


3.5  Discussion 

The  mechanisms  responsible  for  the  observed  density  and  velocity  pertuibadons  in  the 
troposphere,  stratosphere,  and  mesosphere  have  been  the  subject  of  much  speculation.  Two 
quite  different  mechanisms  have  been  suggested  to  explain  horizontal  wavenumber  spectra  of 
density  and  velocity  petturbadons.  The  two  mechanisms  are  quasi  two-dimensional  turbulence 
and  internal  gravity  waves.  Both  mechanisms  predict  a  kx*^^  dependence  in  horizontal 
wavenumber  spectra  [Gage  andNastromt\9%5\  1986].  The  quasi  two-dimensional  turbulence 
was  suggested  by  Gage  andNastrom  [1985;  1986]  for  the  horizontal  wavenumber  spectra  of 
meridional  and  zonal  velocides  obtained  from  the  Global  Atmospheric  Sampling  Program 
(GASP).  The  GASP  data  were  collected  with  instrumentadon  on  Boeing  747  aircraft  at  aldtudcs 
ranging  from  9  to  14  km.  The  slopes  of  both  zonal  and  meridional  wavenumber  spectra  were 
approximately  -3  at  horizontal  scales  from  7(X}  km  to  10(X)0  km,  and  -5/3  at  scales  from  3  km  to 
500  km.  When  Gage  and  Nastrom  applied  the  Taylor  transformadon  to  temporal  frequency 
spectra,  they  found  that  the  transfoimed  frequency  spectra  agreed  with  the  GASP  horizontal 
wavenumber  spectra  in  both  amplitude  and  shape.  The  temporal  frequency  spectra  were 
computed  from  the  data  obtained  at  aldtudcs  near  86  km  with  an  ST/MST  radar.  Hence,  Gage 
and  Nastrom  suggested  that  the  quasi  two-dimensional  turbulence  was  responsible  for  the 
spectra. 

Fritts  et  ai  [1989]  suggested  that  internal  gravity  waves  were  responsible  for  their 
horizontal  wavenumber  spectra.  Their  spectra  were  calculated  from  atmospheric  density 
variations  measured  during  seven  re-entries  of  NASA  space  shuttles  in  the  altitude  range  from  60 
to  90  km.  The  spectra  exhibited  a  slope  of  approximately  -2  at  horizontal  scales  from  about  10  to 
KXX)  km. 

The  horizontal  wavenumber  spectra  inferred  from  the  airborne  Na  lidar  data  appear  to  be 
the  result  of  intemai  gravity  waves  instead  of  turbulence  for  the  following  reasons.  Furst,  the 
horizontal  and  vertical  variations  of  the  Na  layer  were  apparently  influenced  by  waves.  For 


79 


example,  during  the  eastward  flight  the  variations  of  the  centroid  height  and  layer  shape  were 
significantly  influenced  by  waves.  Second,  the  horizontal  wavenumber  spectra  computed  from 
the  lidar  data  exhibited  distinct  spectral  peaks  which  £q)pear  to  be  characteristics  of  waves. 
Interestingly,  the  lidar  observation  of  the  wave  with  a  period  of  6.1  hours  on  the  eastward  flight 
seems  to  agree  with  radar  observations.  During  the  period  from  November  4  to  20, 1986,  the 
data  collected  with  the  ST  radar  in  Platteville,  Colorado  showed  strong  6  hour  variations  in 
winds  in  the  altitude  range  of  the  Na  layer  (see  Section  4.3).  Although  only  the  zonal  parameters 
of  the  6.1-hour  period  wave  were  explored  with  the  airborne  Na  lidar,  the  estimated  wave 
parameters  appear  to  be  the  intrinsic  parameters.  This  wave  seems  responsible  for  the  6-hour 
period  wind  variations  observed  with  the  Platteville  radar,  and  is  probably  tidally  driven  because 
of  its  period  and  westward  phase  propagation.  Finally,  Doppler-shifting  efiects  which  depended 
on  flight  directions  were  clearly  apparent  in  the  horizontal  wavenumber  spectra.  As  a 
consequence,  the  spectra  changed  significantly  in  amplitude  and  shape  when  flight  directions 
changed.  Some  spectral  peaks  were  Doppler  shifted  to  lower  wavenumbers,  while  others  were 
shifted  to  higher  wavenumbers. 

From  ground-based  Na  lidar  observations  of  quasi-monochromatic  gravity  waves 
obtained  at  Urbana,  Illinois,  Gardner  and  Voelz  [  1987]  estimated  the  slope  of  the  wave  khietic 
energy  distribution  to  be  -2.95  for  vertical  scales  from  2  km  to  17  km,  and 
-1.05  for  horizontal  scales  from  20  km  to  3000  km.  These  kinetic  energy  distribution  slopes  are 
comparable  to  the  slopes  of  the  vertical  and  horizontal  wavenumber  spectra  calculated  in  this 
paper  from  the  airborne  Na  lidar  data.  The  average  slope  of  the  vertical  wavenumber  spectra  of 
the  airborne  lidar  data  was  -2.67  at  scales  from  2  km  to  10  km,  and  the  average  slope  of  the 
horizontal  wavenumber  spectra  was  -1.25  at  scales  from  70  km  to  700  km. 

The  slope  of  the  horizontal  wavenumber  spectra  calculated  from  the  airborne  lidar  data 
appears  to  be  shallower  than  the  slopes  of  the  G.ASP  [Gage  and  Nastrom,  1 985;  1 9861  and 
shuttle  re-entry  data  [Fritts  et  al.,  1989].  The  slope  of  the  GASP  data  was  about  -3  at  horizontal 


80 


scales  from  700  km  to  10000  km  and  -5/3  from  3  km  to  SOO  km.  The  spectral  slopes  of  the 
airborne  lidar  data  do  appear  to  increase  significantly  for  horizontal  scales  greater  than  700  km. 
However,  because  the  distances  of  the  flight  legs  varied  from  1062  km  to  1933  km,  the  spectral 
amplitudes  at  scales  larger  than  700  km  should  be  viewed  with  caution.  The  differences  between 
the  GASP  and  airborne  lidar  data  may  stem  ffom  the  difference  in  the  aldtude  range  of  the  two 
data  sets.  The  aldmde  range  of  the  airborne  lidar  data  was  from  82  to  101  km,  and  that  of  the 
GASP  data  was  from  9  to  14  km.  The  horizontal  spectra  slopes  of  the  airborne  lidar  data  are  also 
shallower  than  the  slope  of  about  -2  estimated  from  the  shutde  re-entry  data. 

The  slopes  of  the  verdcal  wavenumber  spectra  calculated  ffom  the  airborne  lidar  data 
appear  to  be  comparable  to  the  slopes  obtained  from  radar  observadons.  For  example,  Snuth  et 
al.  [1985]  calculated  the  spectral  slopes  of  -2.0  to  -2.8  from  data  collected  with  an  MST  radar  at 
Poker  Flat,  Alaska  in  the  aldtude  range  from  82  to  88  km.  The  amplimdes  of  the  verdcal 
wavenumber  spectra  of  the  airborne  lidar  data  also  agree  within  a  factor  of  3  with  those  of  the 
spectra  observed  with  radar  systems  [Tsuda  etai,  1988;  Vincent,  1984]. 

Tlie  horizontal  velocity  variances  computed  from  the  airborne  lidar  data  also  appear  to  be 
comparable  to  those  calculated  from  the  ground-based  lidar  data  and  the  radar  data,  but  larger 
than  those  calculated  from  the  shutde  re-entry  data.  The  horizontal  velocity  variances  computed 
from  the  airborne  lidar  data  ranged  from  310  to  1800  m^  s*2  with  a  mean  of  1 100  s*2.  These 

variances  are  comparable  to  those  computed  from  the  ground-based  lidar  observations  at  Mauna 
Kea  Observatory,  Hawaii,  at  Broomfield,  Colorado,  at  Urbana,  Illinois,  and  at  the  Goddard 
Space  Flight  Center,  Maryland.  The  variances  of  the  radar  data  collected  in  Alaska  ranged  from 
650  to  1300  m^  s*2  [Balsley  and  Carter,  1982;  Balsley  and  Garello,  1985].  The  variances  of  the 
shutde  re-entry  data  collected  over  the  Pacific  Ocean  ranged  from  55  to  415  m^  s*2  [Fritts  et  al., 
1989]. 


81 


3.6  Summary 

The  kinetic  energy  horizontal  and  vertical  wavenumber  spectra  of  horizontal  winds  are 
infened  fix>m  the  airborne  lidar  measurements  of  Na  density  profiles.  The  airborne  data  were 
collected  during  two  flights  in  November  1986.  The  two  flights  included  one  roundtrip  from 
Denver,  Colorado  to  Springfield,  Illinois  and  another  from  Denver  to  the  Pacific  Coast.  During 
these  flights,  the  data  were  collected  during  the  1 1.2  hours  of  flights  over  baselines  totaling 
6000  km.  The  average  slope  of  the  horizontal  wavenumber  spectra  was  -1.25  at  horizontal 
scales  ranging  from  70  to  700  km,  and  the  average  slope  of  the  vertical  wavenumber  spectra  was 
-2.67  at  vertical  scales  from  2  to  10  km.  The  altitude  range  of  the  measurements  was 
approximately  82  to  101  km.  The  slopes  of  the  horizontal  wavenumber  spectra  computed  only 
for  the  bottomside  (82-90  km)  of  the  Na  layer  were  consistendy  steeper  than  those  computed  for 
the  topside  (90-101  km).  The  average  slope  of  the  bottomside  horizontal  wavenumber  spectra 
was  -1.51,  and  that  of  the  topside  spectra  was  -0.97. 

Internal  gravity  waves  appear  to  be  responsible  fer  major  features  of  the  airborne  Na  lidar 
data.  The  observed  features  include  the  systematic  horizontal  and  vertical  variations  of  the  Na 
density  profiles,  the  horizontal  variations  of  the  centroid  height,  the  presence  of  distinct  spectral 
peaks  in  the  horizontal  wavenumber  spectra,  and  Doppler-shifting  of  these  spectral  peaks.  It  is 
difficult  to  interpret  these  observed  features  in  terms  of  nirbulence.  However,  it  seems  possible 
that  the  dominant  density  perturbations  observed  in  the  lidar  data  at  horizontal  scales  in  the  range 
from  20  to  2000  km  are  due  to  gravity  waves. 

The  slopes  of  the  vertical  wavenumber  spectra  agree  well  with  the  slopes  calculated  from 
radar  observations.  The  slopes  of  the  horizontal  wavenumber  spectra  of  the  airborne  lidar  data 
are  shallower  than  the  slopes  of  the  GASP  and  shuttle  re-entry  spectra.  In  general,  the  rms 
horizontal  wind  velocities  increased  with  time  and  with  longitude  from  the  Pacific  Coast  to  the 
Great  Plams.  The  rms  horizontal  wind  veiocities  inferred  from  liic  airborne  data  are  comparable 
with  those  inferred  from  ground-based  lidar  data  obtained  in  Hawaii,  Colorado,  Illinois,  and 


82 


Maryland.  This  i^)peais  to  indicate  that  the  ims  horizontal  wind  velocities  measured  over  the 
Pacific  Ocean  ate  comparable  to  those  measured  over  the  land  masses. 


83 


4.  CORRELATIVE  RADAR  AND  AIRBORNE  SODIUM  LIDAR 
OBSERVATIONS  OF  THE  VERTICAL  AND  HORIZONTAL 
STRUCTURE  OF  GRAVITY  WAVES  AND  TIDES  NEAR 
THE  MESOPAUSE 


4.1  Introduction 

For  the  past  several  decades,  radar  techniques  have  made  important  contribudons  to  the 
understanding  of  the  dynamics  of  the  mesopause  region.  In  recent  years,  Na  lidar  techniques 
have  matured  to  the  extent  that  lidar  studies  of  wave  dynamics  are  now  complementing  radar 
observations.  Vincent  and  Fritts  [1987]  recently  reported  a  radar-based  climatology  smdy  of 
gravity  waves  measured  at  Adelaide,  Australia.  Gardner  and  Voelz  [1987]  also  reported 
extensive  lidar-based  observations  of  monochromatic  gravity  waves  measured  at  Urbana, 
Illinois.  Studies  of  atmospheric  tides  using  Na  lidars  have  been  reported  by  Batista  et  al.  [1985] 
and  Kwon  et  al.  [1987]. 

Joint  observations  of  radar  and  Na  lidar  for  exploring  dynamics  of  the  mesopause  region 
have  been  rare.  The  only  joint  observations  were  reported  by  Avery  and  Tetenbaum  [1983]. 
The  measurements  were  conducted  at  Urbana,  Illinois  in  January  and  March,  1980  using  the 
Urbana  meteor  radar  and  Na  lidar.  The  results  showed  some  correlation  between  the  altitude  of 


maximum  Na  density  and  the  altitude  of  the  zero  wind  node  of  the  prevailing  wind.  The 
reconstruction  of  the  neutral  atmosphere  density  perturbation  profile  from  meteor  wind 
harmonics  exhibited  small-scale  fluctuations  similar  to  those  observed  in  tlie  Na  density  profiles. 

Because  most  radar  and  lidar  observations  have  been  made  typically  at  fixed  elevation 
angles,  measurements  of  the  horizontal  structure  of  gravity  waves  have  been  limited.  By  using 
Doppler  radars  and  dual  bistatic  radars,  the  horizontal  structure  of  gravity  waves  was  studied  by 


Vincent  and  Reid  [1983],  Sniiili  and  Fntis  [1983],  and  Meek  et  al.  [1985].  By  using  multiple 


spaced  lidars  and  steerable  lidars,  the  horizontal  structure  of  the  Na  layer  was  also  studied  by 


84 


Thomas  et  al.  [19771,  Clemesha  et  d.  [1981],  and  Gardner  et  al.  [1982;  1986].  Because  of  the 
increased  atmospheric  attenuation  and  longer  propagation  paths  at  the  lower  elevation  angles, 
steerable  lidar  measurements  are  limited  to  a  baseline  of  about  100  km  or  less. 

To  study  longer  baselines,  the  UIUC  group  conducted  an  airborne  lidar  experiment  with 
the  support  of  the  NCAR-RAF  in  Broomfield,  Colorado  in  November  1986.  To  investigate  the 
longitudinal  characteristics  of  gravity  waves  and  tides  at  mid-latitudes,  a  total  of  three  flights 
were  made  over  the  Great  Plains,  Rocky  Mountains,  and  Pacific  Coast  The  flights  were 
conducted  out  of  Stapleton  International  Airpon  in  Denver,  Colorado  using  the  NCAR  Electra 
aircraft  In  addition  to  the  airborne  observations,  the  lidar  was  operated  on  the  ground  in 
Broomfield  and  Denver  during  several  nights  from  November  7  to  20.  During  the  ground-based 
and  airborne  lidar  observations,  the  University  of  Colorado  group  operated  an  ST  radar  in 
Platteville,  Colorado.  The  results  of  these  joint  lidar/radar  observations  are  presented  in  this 
chapter. 

4.2  Description  of  the  Experiment 

4.2.1  Radar 

From  November  4  to  20, 1986,  the  ST  radar  located  at  Platteville,  Colorado  (40'’N, 
104®W)  was  operated  by  the  University  of  Colorado  group.  In  order  to  measure  wind 
perturbations  in  the  mesosphere  and  lower  themnosphere,  a  meteor  echo  detection  and  collection 
system  was  attached  to  the  ST  radar,  and  operated.  This  system  is  described  in  detail  by  Wang  et 
al.  [1988].  Two  radar  beams  were  directed  at  azimuth  angles  of  0®  (north)  and  90®  (east).  The 
zenith  angles  of  both  beams  were  15®,  and  the  operating  frequency  was  49.8  MHz. 


85 


4.2.2  Lidar 

In  early  November  of  1986,  the  UIUC  lidar  was  installed  on  board  the  Electra  aircraft 
operated  by  the  NCAR-RAF  in  BroomHeld,  Colorado.  The  system  included  a  flashlamp- 
pumped  dye  laser,  a  35  cm  diameter  Cassegrain  telescope,  and  associated  electronic  and  optical 
subsystems.  The  major  lidar  and  aircraft  parameters  were  summarized  in  Table  2.1.  Following 
the  installation,  ground-based  lidar  observations  were  obtained  in  Broomfield  for  a  total  of  17.5 
hours  on  4  different  nights  from  November  7  to  20.  Additional  ground-based  observations  were 
obtained  in  Denver  as  pre-  and  post-flight  tests  for  a  total  of  2  hours  on  3  different  nights  from 
November  13  to  18.  The  airborne  lidar  observations  were  described  in  detail  in  Section  3.3. 
Table  4.1  summarizes  the  system  performance  and  observation  times  for  the  ground-based  and 
airborne  lidar  observations. 


4  J  Radar  Observations 

The  vertical  profiles  of  average  meridional  and  zonal  background  wind  velocities 
measured  from  November  4  to  20, 1986  are  plotted  in  Figure  4. 1  a  and  b.  The  average  zonal 
background  wind  velocity  was  approximately  5  m  s*'  eastward  in  the  altitude  range  of  the  Na 
layer  (80-105  km).  The  average  meridional  background  wind  velocity  was  small  compared  to 
the  zonal  background  wind  velocity  at  altitudes  lower  than  96  km.  However,  the  meridional 
background  wind  velocity  appears  to  increase  at  a  higher  altimde  than  96  km. 

The  diurnal  variation  of  the  horizontal  wind  perturbations  measured  near  90  km  is  plotted 
in  Figure  4.2.  The  boxes  represent  the  hourly  velocity  measurements,  and  the  solid  curve 
represents  a  sinusoidal  least-squares  fit.  The  area  of  the  boxes  is  proportional  to  the  number  of 
echoes  used  in  computing  the  hourly  average.  The  period  of  the  dominant  oscillation  is 
approximately  6  hours,  and  the  amplitude  of  the  6-hour  period  oscillation  is  about  1.3  m  s'K 

The  vciucal  profiles  of  the  6-hcur  period  meridional  and  zonal  wind  velocities  arc  plotted 
in  Figure  4.3.  Note  that  the  amplitudes  of  the  zonal  winds  are  much  larger  than  the  amplitudes  of 


86 


Table  4.1.  Summary  of  the  Airborne  and  Ground-based 
Na  Lldar  Observations 


Signal  Level^ 


Date 

rimslMSD^ 

(WBni/5h9t) 

Lggaagg 

Comments 

November  7-8 

2259-0331 

0.6 

Broomfield 

Ground-based 

November  11 

0236-0300 

0.8 

Broomfield 

Ground-based 

November  12-13 

2019-0043 

2 

Broomfield 

Ground-based 

November  13 

2015-2033 

12 

Denver 

Ground-based 

November  13 

2119-2357 

10 

CO,NM.AZ,VY 

Triangular  flight 

November  15 

1943-2055 

6 

Denver 

Ground-based 

November  15-16 

2210-0309 

7 

CO-IL 

Eastward  flight 

November  15 

0352-0357 

4 

Denver 

Ground-based 

November  17 

2011-2033 

6 

Denver 

Ground-based 

November  17-18 

2141-0502 

8 

(^OPacific 

Westward  flight 

November  19-20 

2228-0625 

6 

Broomfield 

Ground-based 

Total  Observation  time 
Ground-based:  19  hours  28  min 
Airborne:  14  hours  58  min 


aMST  =  UT-7hours. 

typical  signal  level  at  Jrbana  is  5  counts/shoi. 


87 


HEIGHT  (KM) 


a)  b) 

MERIDIONAL  MEAN  ZONAL  MEAN 


MS-1 


MS-1 


Figure  4.1.  Vertical  profiles  of  a)  the  meridional  component  and  b)  the  zonai  component  of 
average  background  wind  velocity  measured  with  Platteville  ST  radar  during  the 
period  from  November  4  to  20,  1986.  The  data  were  provided  by  the  University  nf 
Colorado. 


88 


HEIGHT  (KM) 


a) 


b) 


0.  3.  g.  t.  to.  13.  to.  to.  zt.  34.  se. 


AMPLITUDE  (MS-) 


AMPLITUDE  (MS-) 


Figure  4.3.  Vertical  profiles  of  a)  the  meridional  component  and  b)  the  zonal  component  of  the 
6-hour  period  horizontal  winds.  The  data  were  provide  by  the  University  of 
Colorado. 


90 


the  meridional  winds  in  the  altitude  region  from  85  to  92  km.  This  altitude  region  approximately 
corresponds  to  the  bottomside  of  the  Na  layer.  The  largest  zonal  wind  velocity  in  this  altitude 
region  was  9  m  s*'  at  88  km.  Large  variations  with  periods  of  6  hours  were  also  observed  with 
the  ground-based  Na  lidar  (Section  4.4)  and  the  airborne  Na  lidar  (Section  4.5),  and  were 
dominant  only  on  the  bottomside  of  the  Na  layer.  These  6-hour  period  variations  appear  to  be 
related  to  the  6-hour  period  variations  in  the  horizontal  winds  measured  with  the  radar. 

4.4  Ground-based  Lidar  Observations 

From  November  7  to  20, 1986,  a  total  of  19.5  hours  of  ground-based  Na  data  were 
collected  in  Broomfield  and  Denver,  Colorado  with  the  lidar  on  board  the  NCAR  Electra.  In  this 
section,  the  longest  data  set  obtained  on  the  night  of  November  19-20  in  Broomfield  will  be 
presented.  The  time  sequence  of  the  density  profiles  measured  on  this  night  is  plotted  in  Figure 
4.4.  The  density  profiles  have  been  filtered  vertically  with  a  cutoff  of  3  km  and  temporally  with 
a  cutoff  of  50  min.  The  profiles  have  been  normalized  so  that  each  has  the  same  column 
abundance  and  are  plotted  on  a  linear  scale  at  10  min  intervals. 

The  temporal  variations  of  the  layer  centroid  height,  rms  width,  and  column  abundance 
are  plotted  in  Figure  4.5.  All  three  parameters  show  large  variations  with  a  period  of 
approximately  6  hours.  For  example,  the  local  maxima  of  the  centroid  height  occurred  at  2300 
and  0452  MST.  The  time  difference  between  these  maxima  was  5.9  hours.  The  periods  of  the 
dominant  rms  width  and  column  abundance  variations  were  approximately  6.6  hours  and  7.0 
hours,  respectively.  All  three  parameters  also  exhibit  shorter  temporal  variations  from  CKXJO  to 
0230  MST  with  a  period  of  approximately  2  hours.  For  example,  the  local  minima  of  the 
centroid  height  occurred  at  (X)12  and  0155  MST  for  a  time  difference  of  1.7  hours.  The  periods 
of  the  shorter  temporal  variations  in  the  rms  width  and  column  abundance  were  1.9  and  1.8 
hours,  respectively. 


91 


0300  0400  0500  0600 


MOUNTAIN  STANDARD  TIME 


Figure  4.4.  Sodium  density  profiles  measured  on  the  night  of  November  19-20,  1986.  The 
profiles  have  been  filtered  vertically  with  a  cutoff  of  3  km  and  temporally  with  a 
cutoff  of  50  min.  The  profiles  have  been  normalized  so  that  each  has  the  same 
column  abundance,  and  are  plotted  on  a  linear  scale  at  10  min  intervals. 


92 


93 


1-igiirc  4.3.  'reinporal  variations  of  the  layer  centroid  liciglu,  mis  width,  and  column  abundance 
measured  on  November  19-20,  1986. 


The  shorter  temporal  variations  were  more  dominant  on  the  bottomside  of  the  Na  layer. 
The  temporal  frequency  spectrum  of  the  relative  sodium  density  perturbations  for  the  whole  layer 
is  plotted  in  Figure  4.6  a,  and  the  temporal  frequency  spectra  computed  for  the  bottomside  and 
the  topside  of  the  layer  are  plotted  in  Rgure  4.6  b.  The  technique  for  computing  the  temporal 
spectra  is  described  by  Gardner  and  Senft  [1989].  The  Na  density  profiles  were  vertically 
filtered  with  a  cutoff  of  1  km  before  the  spectra  were  calculated.  The  altitude  range  for  the 
bottomside  was  82  to  90  km,  and  that  for  the  topside  was  90  to  100  km.  The  straight  lines  in 
Hgure  4.6  are  linear  regression  fits  which  were  used  to  estimate  the  spectral  slopes  over  temporal 
scales  from  0.5  to  6  hours.  The  spectral  slopes  were  approximately  -1.52  for  the  whole  layer 
spectrum,  -1.77  for  the  bottomside  spectrum,  and  -1.12  for  the  topside  spectrum.  The  topside 
spectral  amplitudes  are  larger  which  indicates  that  the  wave  amplitudes  were  growing  with 
altitude.  The  shallower  slope  of  the  topside  spectmm  may  be  the  result  of  saturation  affecting 
longer  period  waves.  Note  the  dominant  spectral  peak  in  the  bottomside  spectrum  near  1.5x10^ 
s*^  The  period  is  calculated  to  be  1.9  hours,  which  is  quite  comparable  to  the  periods  of  the 
shorter  temporal  variations  in  the  centroid  height,  rms  width,  and  column  abundance.  The 
topside  spectrum  does  not  exhibit  a  spectral  peak  at  the  same  period. 

The  shorter  period  oscillations  are  also  seen  clearly  in  temporal  variations  of  Na  density. 
Figure  4.7  shows  the  relative  temporal  variations  from  81  to  92  km.  The  Na  density  profiles 
were  first  filtered  vertically  with  a  cutoff  of  5  km  and  temporally  with  a  cutoff  of  90  min.  The 
density  variations  were  then  computed  and  normalized  so  that  the  peak-to-peak  amplitudes  at 
each  altitude  were  equal.  The  vertical  phase  progression  of  the  shorter  period  variations  is  most 
clearly  seen  between  2230  and  0230  MST  in  the  altitude  range  from  81  to  89  km.  At  higher 
altitudes,  the  variations  with  a  period  near  2  hours  are  not  in  evidence.  The  diagonal  lines 
indicate  the  apparent  phase  progressions  from  which  the  vertical  phase  velocity  is  estimated  to  be 
1.0  m  s*'-.  This  vclcciry  corresponds  to  a  vertical  wavelengdi  of  12  kjii  for  a  wave  with  the 


94 


HORIZONTAL  VELOCITY  SPECTRUM  (  (m/s)  / (cyc/m) ) 


NOVEMBER  19-20,  1986 


FREQUENCY  (cyc/s) 


Figure  4.6  a).  Temporal  frequency  spectrum  computed  for  the  whole  layer  from  data  collected 
on  November  19-20, 1986.  The  Na  density  profiles  were  venically  filtered  with 
a  cutoff  of  1  km  before  the  spectra  were  computed.  The  straight  line  is  a  linear 
regression  fit  which  was  used  to  estimate  the  spectral  slopes  over  temporal  scales 
from  30  to  360  min. 


95 


ATMOSPHERIC  DENSITY  SPECTRUM  (m/cyc) 


HORIZONTAL  VELOCITY  SPECTRUM  ((m/s)  / (cyc/m) ) 


NOVEMBER  19-20,  1986 


10 


-1 


FREQUENCY  (cyc/s) 


Figure  4.6  b).  Temporal  frequency  spectra  computed  for  the  layer  tx)ttomside  and  topside  from 
data  collected  on  November  19-20,  1986.  The  Na  density  profiles  were  vertically 
filtered  with  a  cutoff  of  1  km  before  the  spectra  were  computed.  The  straight 
lines  are  linear  regression  fits  which  were  used  to  estimate  the  spectral  slopes  over 
temporal  scales  from  30  to  360  min. 


96 


ATMOSPHERIC  DENSITY  SPECTRUM  (m/cyc) 


ALTITUDE  (km) 


92 

91 

90 

89 

88 

87 

86 

85 

BA 

83 

82 

81 


22  23  0  1234  5  6  7 

MOUKTAIN  STANDARD  TIME 


Figure  4.7.  Relarive  temporal  variations  of  the  Na  density  from  81  to  92  km  measured  on 
November  19-20, 1986.  The  Na  density  profiles  were  vertically  filtered  with  a 
cutoff  of  5  km  and  temporally  with  a  cutoff  of  90  min.  The  diagonal  lines  indicate 
apparent  vertical  phase  progressions.  The  estimated  phase  velocity  is  1.8  m  s'*. 


NOVEMBER  19-20,  1986 


97 


period  of  1.9  hours.  Because  ‘his  wave  may  be  Doppler  shifted  by  the  background  winds,  the 
intrinsic  period  and  vertical  wavelength  may  be  different  than  these  values. 

The  vertical  wind  velocity  on  the  layer  bottomside  at  82  km  is  plotted  versus  time  in 
Figure  4.8.  The  technique  for  estimating  the  vertical  wind  velocity  is  described  in  detail  by 
Kwon  et  al.  [1987].  To  compute  the  vertical  winds,  the  Na  density  profiles  were  filtered 
vertically  with  a  cutoff  of  5  km  and  temporally  with  a  cutoff  of  60  min.  The  period  of  the  most 
dominant  variation  was  1.9  hours,  and  the  amplitude  of  this  variation  decreased  significantly 
with  time  toward  the  early  morning  of  November  20.  The  vertical  wind  velocity  on  the  layer 
topside  at  101  km  is  plotted  in  Figure  4.9.  Although  a  variation  with  the  period  of  approximately 
2  hours  is  present  at  this  altitude,  the  amplitude  of  the  2-hour  period  oscillation  is  much  smaller 
than  at  82  kra 

The  average  vertical  wavenumber  power  spectrum  of  the  Na  density  profiles  obtained 
from  2300  to  0100  MST  is  plotted  in  Figure  4.10.  This  time  interval  corresponds  to  the  period 
during  which  the  amplitude  of  the  2-hour  period  wave  was  large.  The  dashed  line  in  the 
spectrum  indicates  the  estimated  shot  noise  level.  The  technique  for  estimating  vertical 
wavelengths  of  gravity  waves  from  the  Na  profile  vertical  spectra  is  described  in  detaU  by 
Gardner  and  Voelz  [1987].  The  wavelengths  of  dominant  waves  in  the  vertical  power  spectrum 
in  Figure  4.10  appear  to  be  6.4  km  and  3.1  km.  These  wavelengths  may  represent  harmonics  of 
the  vertical  wavelength  of  the  2-hour  period  wave,  which  is  about  12  km. 

The  rms  horizontal  wind  velocity  computed  for  the  entire  altinide  range  of  the  Na  layer  is 
plotted  versus  time  in  Figure  4.1 1.  The  technique  for  computing  the  rms  horizontal  winds  is 
described  by  Kwon  et  al.  [1989a].  The  Na  density  profiles  were  initially  filtered  vertically  with  a 
cutoff  of  1  km  and  temporally  with  a  cutoff  of  20  min.  The  wind  amplitudes  exhibit  strong  2- 
hour  period  oscillations  from  2300  to  0200  MST,  and  agree  qualitatively  with  the  temporal 
variations  of  the  vertical  winds  estimated  at  the  altitude  of  82  km  (Figure  4.8).  The  amplitude  of 
the  horizontal  winds  for  the  2-hour  period  oscillations  is  about  7  m  s'*  near  midnight  The 


98 


VERTICAL  WIND  (cm/sec) 


NOVEMBER  19-20,  1986 


MOUKTAIN  STANDARD  TIME 


Figure  4.8.  The  venical  wind  velcx:ity  estimated  at  the  altitude  of  82  km.  The  Na  density 

profiles  were  initially  filtered  venically  with  a  cutoff  of  5  km  and  temporally  with  a 
cutoff  of  60  min. 


99 


VERTICAL  WIND  (cm/ sec) 


NOVEMBER  19-20.  1986 


MOUNTAIN  STANDARD  TIME 


Figure  4.9.  The  vertical  wind  velocity  estimated  at  the  altitude  of  101  km.  The  Na  density 

profiles  were  initially  filtered  vertically  with  a  cutoff  of  5  km  and  temporally  with 
cutoff  of  60  min. 


100 


VERTICAL  POWER  SPECTRUM  (dB) 


NOVEMBER  19-20,  1906 


Fiizure  4.10.  The  average  vertical  wavenumber  power  spectrum  computed  for  the  Na  density 
profiles  collected  from  2300  to  01(X)  MST  on  November  19-20,  1986. 


101 


RHS  HORIZONTAL  WIND  VELOCITY  (ms 


NOVEMBER  19-20.  1986 


22  23 


MOUNTAIN  STANDARD  TIME 


Figure  4.1 1.  The  rms  horizontal  wind  velocity  interred  from  the  Na  density  profiles  collected  on 
November  19-20,  1986.  The  profiles  were  initially  filtered  vertically  with  a  cutoff 
of  1  km  and  temporally  with  a  cutoff  of  20  min. 


i 


ATMOSPHERIC  DENSITY  PERTURBATIONS  (%) 


horizontal  wind  axnplitude  of  the  2-hour  period  wave  is  related  to  the  amplitude  of  the  spectral 
peak  at  the  period  of  about  120  min  in  the  temporal  frequency  spectrum  (Figure  4.6  a)  by 


ExK) 


2o? 

cosh(pAz)  -  -ijgjjsinhCpAz) 


(4.1) 


where  Ex(O)o)  »  spectral  amplitude  at  firequency(no(m'2s'^), 

AT  »  observadon  period  (s), 

Vx(Zo)  =  horizontal  wind  amplitude  at  the  aldtude  of  the  centroid  height  of  the  Na 
layer  (m  s-i), 

P  3  amplitude  growth  factor  (m*^), 

Az  =  verdcal  extent  of  the  Na  layer  (m), 

Co  =  rms  width  of  the  Na  layer  (m), 

Y  =  rado  of  specific  heats  (« 1.4),  and 
H  s  atmospheric  scale  height  (m). 

By  assuming  that  PAz<l  for  the  2-hour  period  wave,  the  horizontal  wind  amplitude 
which  corresponds  to  the  2-hour  spectral  peak  in  Hgure  4.6  a  is  calculated  to  be  approximately  7 
m  s'l.  This  velocity  is  for  the  horizontal  winds  averaged  over  the  total  observadon  period  on 
November  19-20,  and  compares  favorably  with  the  velocity  of  7  m  s**  estimated  from  the  rms 
horizontal  winds  plotted  in  Hgure  4.1 1. 

For  an  atmospheric  wind  field  generated  by  a  monochromatic  wave.  Hints  [1960] 
showed  that  the  horizontal  wind  amplitude  is  related  to  the  vertical  wind  amplitude  by 


(4.2) 


where  Vj  =  vertical  wind  amplitude. 

Tin  =  intrinsic  wave  period, 

Tb  =  Brunt-Vaisala  period  (=  5  niin). 


103 


When  the  2-hour  period  wave  was  strong  near  midnight,  the  amplitude  of  the  horizontal 
winds  was  7  m  s'^  and  that  of  the  vertical  winds  was  0.3  m  s*^,  which  is  the  average  of  the 
amplitudes  estimated  at  82  km  (Figure  4.8)  and  at  101  km  (Figure  4.9).  Thus  =  23 
compares  favorably  with  the  ratio  T/Tb  »  24  for  the  2-hour  period  wave. 

Dominant  waves  with  the  period  of  2  hours  have  been  observed  often  at  Urbana,  Illinois 
[Gardner  et  al.,  1986;  Gardner  and  Voelz^  1987;  Gardner,  1989].  For  example,  the  density 
profiles  obtained  during  the  early  morning  of  June  12, 1984  [Gardner  et  al.,  1986]  and  June  28, 
1988  [Gardner,  1989]  exhibited  large  vertical  displacements  of  the  bottomside  of  the  layer  with  a 
period  of  about  2  hours.  During  these  two  nights,  the  2-hour  period  oscillation  was  dominant 
only  on  the  bottomside  of  the  layer.  However,  the  vertical  wavelength  of  12  km  measured  on 
November  19-20  is  larger  than  the  vertical  wavelengths  of  2-hour  period  gravity  waves  which 
have  been  fiequentiy  observed  at  Urbana.  The  vertical  wavelengths  for  the  majority  of  the  2- 
hour  period  gravity  waves  ranged  ftom  4.4  to  7.3  km  [Gardner  and  Voelz,  1987]. 

The  6-hour  period  variations  observed  in  the  centroid  height,  rms  width,  and  column 
abundance  are  also  seen  in  the  vertical  variations  of  local  Na  density  maxima  plotted  in  Figure 
4.12.  The  Na  density  profiles  were  initially  filtered  vertically  with  a  cutoff  of  3  km  and  then 
temporally  with  a  cutoff  of  50  min.  There  appear  to  be  two  maxima  with  dominant  downward 
motions.  Linear  regression  fits  were  used  to  estimate  the  vertical  velocities.  The  upper  density 
maximum  moved  downward  at  a  velocity  of  19.1  cm  s"^,  and  the  lower  density  maximum 
moved  at  18.3  cm  s'^.  The  average  vertical  velocity  was  18.7  cm  s*^  'ITie  tiroe  separation 
between  these  maxima  was  5.6  hours.  The  measured  velocity  corresponds  to  a  vertical 
wavelength  of  about  4  km  for  a  wave  with  the  period  of  6  hours.  This  vertical  wavelength 
appears  to  be  far  too  small  to  induce  the  large  6-hour  period  variations  in  the  centroid  height,  rms 
width,  and  column  abundance  in  the  Na  layer.  Gardner  and  Shelton  [1985]  derived  theoredcal 
expressions  for  the  layer  parameter  variations  in  terms  of  the  wave  period,  vertical  wavelength, 
and  amplitude.  Waves  with  vertical  wavelengths  less  than  5  km  have  a  negligible  effect  on  the 


104 


NOVEMBER  19-20.  1986 


MOUNTAIN  STANDARD  TIME 


Figure  4. 1 2.  Temporal  variations  of  the  altitudes  of  local  Na  density  maxima  measured  on 

N  vember  19-20,  1986.  The  Na  density  profiles  were  initially  filtered  venically 
with  a  cutoff  of  3  km  and  temporally  with  a  cutoff  of  50  min. 


105 


layer  centroid  height,  nns  width,  and  column  abundance.  Appreciable  perturbations  in  these 
parameters  occur  only  fw  waves  with  verdcal  wavelengths  exceeding  about  10  km.  Therefore, 
the  vertical  wavelength  of  the  6-hour  period  wave  is  believed  to  be  much  larger  than  3.8  km. 

To  estimate  the  vertical  wavelength  of  the  6-hour  period  wave  accurately,  the  average 
vertical  power  spectram  for  the  Na  density  profiles  obtained  &om  0300  to  0625  MST  is 
computed  and  plotted  in  Figure  4.13.  The  vertical  wavelength  of  a  dominant  wave  in  the 
spectrum  is  approximately  7.3  km,  which  may  be  the  vertical  wavelength  of  the  6-hour  period 
wave.  Although  this  vertical  wavelength  is  larger  than  that  estimated  from  the  apparent  verdcai 
progressions  of  the  Na  density  maxima,  it  is  comparable  to  the  vertical  wavelength  of  the  6-hour 
period  waves  observed  at  Urbana,  which  ranged  from  8.1  to  10.2  km  [Gardner  and  Voelz, 
1987].  It  is  also  closer  to  the  value  estimated  from  the  airborne  observations  [Kwon  et  al., 
1989al. 

Gardner  and  Voelz  [1987]  described  the  technique  for  estimating  the  amplitude  of  the 
wave-induced  atmospheric  density  perturbations  from  the  Na  profile  vertical  power  spectra. 
From  Figure  4.13,  the  amplitude  of  the  atmospheric  density  perturbations  due  to  the  6*hoar 
period  v/ave  is  computed  to  be  3.9  %.  Gardner  and  Voelz  [19871  also  showed  that  this 
amplitude  is  related  to  the  horizontal  wind  amplitude  by  the  following  equation. 


where  g  =  gravitational  acceleration  (=  9.5  m  s*^),  and 

AeP*  =  amplitude  of  the  atmospheric  density  perturbations  due  to  the  wave. 

The  horizontal  wind  amplitude  for  the  6-hour  period  wave  is  computed  to  be  17  m  s*l. 

The  average  vertical  wavenumber  spectrum  corresponding  to  all  the  density  profiles 
obtained  on  November  19-20  is  plotted  in  Figure  4.14.  The  technique  for  estimating  the  vertical 
wavenumber  spectrum  is  described  by  Kwon  et  al.  [1989a].  The  Na  density  profiles  were  first 
filtered  temporally  with  a  cutoff  of  20  min,  and  then  the  spectrum  was  calculated.  The  shot  noise 


106 


VERTICAL  POWER  SPECTRUM  (dB) 


HORIZOKfAL  VELOCITY  SPECTRUM  ( (m/s)  / (cyc/m) ) 


NOVEMBER  19-20,  1986 


I 


VERTICAL  WAVENUMBER  (cyc/m) 


Figure  4.14.  Vertical  wavenumber  spectrum  measured  on  November  19-20,  1986.  The  Na 
density  profiles  were  initially  filtered  temporally  with  a  cutoff  of  20  min.  The 
straight  line  is  a  linear  regression  fit  which  was  used  to  estimate  the  spectral  slooe 
over  vertical  scales  ranging  from  2  to  10  km. 


108 


ATMOSPHERIC  DENSITY  SPECTRUM  (m/cyc) 


level  was  estimated  and  subtracted  from  the  spectrum.  The  results  were  plotted  in  Figure  4. 14. 
The  straight  line  is  a  linear  regression  fit  which  was  used  to  estimate  the  spectral  slope  over  the 
vertical  wavelength  range  from  2  to  10  km.  The  slope  is  estimated  to  be  about  -1.16. 

4.5  Airborne  Lidar  Observations 

In  this  section,  the  data  collected  during  tiie  eastward  flight  of  November  15-16, 1986 
will  be  presented.  As  discussed  earlier  in  Section  3.4,  the  Na  layer  was  dotninated  by  two  quasi- 
monochromatic  waves  during  the  flight  These  two  waves  appear  to  have  induced  the  westward 
propagating  centroid  maxima  and  minima  plotted  in  Bgure  3.10.  Tlie  longitudinal  variations  of 
the  rms  width  and  column  abundance  observed  during  the  flight  also  exhibited  the  westward 
propagating  maxima  and  minima  as  can  be  seen  in  Hgure  4.15.  The  parameters  of  the  two 
waves  were  summarized  in  Table  3.2.  The  intrinsic  periods  of  these  two  waves  were  about  6 
hours  and  2  hours,  which  are  surprisingly  similar  to  those  of  the  two  dominant  waves  observed 
with  the  ground-based  lidar  at  Broomfield,  Colorado  on  November  19-20.  The  two  waves 
observed  during  the  flight  were  also  dominant  only  on  the  bottomside  of  the  layer.  The  zonal 
wavelength  of  the  6-hour  period  wave  observed  during  the  eastward  flight  was  related  to  the 
horizontal  separation  distance  between  two  centroid  maxima  measured  on  the  eastbound  leg  near 
99.5"W  and  92.2®W  in  Hgure  3.10,  and  also  the  distance  between  two  maxima  measured  on  the 
westbound  leg  near  104.6®W  and  93.2°W.  These  centroid  maxima  were  propagating  westward 
at  an  apparent  velocity  of  about  30  m  s’K  The  centroid  maximum  near  104.6®W  measured  on  the 
westbound  leg  would  have  propagated  to  Broomfield,  Colorado  (105°W)  near  0300  MST  on 
November  16.  The  centroid  height  observed  at  Broomfield  on  November  19-20  reached  a 
maximum  near  0500  MST  (Figure  4.5),  which  may  represent  a  time  shift  of  2  hours  for  the  6- 
hour  period  wave  over  the  time  span  of  4  days  from  November  16  to  20  when  the  two 
observations  were  made. 


109 


RMS  WIDTH  (km) 


EASTWARD  FLIGHT  (NOVEMBER  15-16,1986) 


LONGITUDE  (°W) 


Figure  4.15  a).  Longitudinal  variations  of  the  nns  width  of  the  Nii  layer  observed  during  the 
eastward  flight  on  November  15-16,  1986. 


110 


COLUMN  ABUNDANCE  (xlO  cm 


106  104  102  100  98  96  94  92  90 

LONGITUDE  (°W) 


Figure  4. 15  b).  Longitudinal  variations  of  the  column  abundance  of  the  Na  layer  observed 
during  the  eastward  flight  on  November  15-16, 1986. 


Ill 


The  average  vertical  wavenumber  power  spectrum  computed  for  the  Na  density  profiles 
obtained  during  the  westbound  flight  leg  in  the  longitude  range  from  96*^  to  102*^  is  plotted  in 
Figure  4.16.  The  vertical  wavelength  of  a  dominant  wave  in  the  spectrum  is  7.3  km,  and  is 
believed  to  be  the  vertical  wavelength  of  the  6-hour  period  wave.  This  wavelength  is  the  same  as 
that  estimated  for  the  6-hour  period  wave  observed  with  the  ground-based  lidar  on  November 
19-20. 

The  vertical  wavelength  of  the  2-hour  period  wave  observed  with  the  ground-based  lidar 
was  estimated  to  be  12  km.  The  vertical  wavelength  of  the  2-hour  period  wave  observed  with 
the  airborne  lidar  is  related  to  the  intrinsic  horizontal  wavelength  by  the  gravity  wave  dispersion 
relation. 


where  Xz  »  intrinsic  vertical  wavelength,  and 
kx  ^  intrinsic  horizontal  wavelength. 

The  intrinsic  horizontal  wavelength  of  the  2-hour  period  wave  was  not  measured  during  the 
flight  However,  by  using  the  intrinsic  zonal  wavelength  of  263  km  and  the  intrinsic  period  of 
102  min,  the  vertical  wavelength  is  calculated  to  be  12.9  km,  which  is  quite  comparable  to  the 
vertical  wavelength  of  12  km  for  the  2-hour  period  wave  observed  with  the  ground-based  lidar 
on  November  19-20.  This  appears  to  indicate  that  the  intrinsic  zonal  wavelength  measured 
during  the  flight  is  comparable  to  the  intrinsic  horizontal  wavelength,  and  consequently  the  wave 
was  propagating  almost  zonally  (westward). 

4.6  Summary 

The  results  of  the  joint  lidar/radar  campaign  conducted  in  November,  1986  are  presented. 
The  lidar  observations  were  obtained  with  the  airborne  Na  lidar  and  with  the  ground-based  lidar 
at  Broomfield  and  Denver,  Colorado.  The  airborne  lidar  data  presented  in  this  chapter  were 
collected  during  the  eastward  flight  on  the  night  of  November  15, 1986,  and  the  ground-based 


112 


VERTICAL  POWER  SPECTRUM  (dB) 


EASTWARD  FLIGHT  -  WESTBOUND  LEG 


VERTICAL  WAVENUMBER  (cyc/km) 


Figure  -i.  1 6.  The  average  vertical  wavenumber  power  spectrum  computed  for  the  Na  density 
profiles  collected  during  the  westbound  leg  of  the  eastward  flight  in  the  longitude 
range  from  96®W  to  102®W. 


113 


lidar  data  were  obtained  for  a  total  of  8  hours  on  the  night  of  November  19, 1986.  The  radar 
observations  were  obtained  with  the  ST  radar  at  Platteville,  Colorado  during  the  period  irom 
November  4  to  20, 1986. 

The  ground-based  lidar  observations  showed  very  strong  density  perturbations  in  the  Na 
layer  with  a  period  of  approximately  6  hours.  The  airborne  observations  of  the  Na  layer  also 
exhibited  a  dominant  variation  by  a  wave  with  a  period  of  approximately  6  hours.  The  intrinsic 
zonal  wavelength  of  this  wave  was  772  km,  and  the  intrinsic  zonal  phase  velocity  was  35  m  s*^ 
This  wave  appears  to  be  responsible  for  the  6-hour  period  density  variations  observed  with  the 
ground-based  lidar.  The  6-hour  period  variations  observed  with  both  the  ground-based  and 
airborne  lidars  were  dominant  only  on  the  bottomside  of  the  layer  (80-90  km).  The  radar 
observations  also  showed  a  dominant  horizontal  wind  variation  with  a  period  of  6  hours.  In  the 
altitude  region  which  conesponds  to  the  bottomside  of  the  Na  layer,  the  amplitudes  of  the  6-hour 
period  zonal  winds  were  much  larger  than  the  amplitudes  of  the  6-hour  period  meridional  winds. 
The  6-hour  period  zonal  winds  were  most  dominant  at  the  altitude  of  88  km,  and  the  amplitude  at 
this  altimde  was  approximately  9  m  s*^  The  horizontal  wind  amplitude  of  the  6-hour  period 
wave  observed  with  the  ground-based  lidar  was  estimated  firom  the  vertical  wavenumber  power 
spectrum  to  be  17  m  s*^ 

Both  the  ground-based  and  airborne  lidar  observations  revealed  another  strong  variation 
in  the  Na  layer  with  a  period  of  about  2  hours.  The  intrinsic  zonal  wavelength  of  the  2-hour 
period  wave  observed  during  the  eastward  flight  was  263  km,  and  the  intrinsic  zonal  phase 
velocity  was  43  m  s**.  The  vertical  phase  velocity  of  the  2-hour  period  wave  observed  with  the 
ground-based  lidar  was  1.8  m  s'^  and  the  vertical  wavelength  was  estimated  to  be  12  km.  These 
2-hour  period  waves  were  again  donunant  only  on  the  bottomside  of  the  Na  layer,  and  believed 
to  be  propagating  westward. 


lU 


5.  LIDAR  OBSERVATIONS  OF  SPORADIC  SODIUM  LAYERS 
AT  MAUNA  KEA  OBSERVATORY  ,  HAWAH 


5.1  Introduction 

The  upper  atmospheric  Na  layer  has  been  explored  with  lidar  systems  for  almost  two 
decades.  The  layer  is  generally  confined  to  an  aldtude  range  of  80-1 10  km,  with  a  peak  density 
near  92  km  of  about  10^-10^  cm*3.  Meteoric  ablation  is  believed  to  be  the  dominant  source  of 
the  Na  layer  iJegou  et  al.,  1985al.  Lidar  studies  have  shown  that  the  layer  is  an  excellent  tracer 
of  wave  motions.  The  layer  profile  is  particularly  sensitive  to  gravity  and  tidal  wave  efiects, 
because  the  wave  amplitudes  are  usually  la^e  near  the  mesopause  and  the  steep  Na  density 
gradients  on  the  bottom  and  topsides  of  the  layer  tend  to  enhance  the  observed  wave 
perturbations.  An  extensive  analysis  of  the  monochromatic  gravity  waves  observed  in  the  Na 
layer  above  Urbana,  Illinois  (40®N,  88®W),  was  recently  reported  by  Gardner  and  Voelz  [1987]. 
A  study  of  tidal  waves  observed  above  the  same  location  has  also  been  reponed  by  Kwon  et  al. 
[1987].  In  addition  to  exploring  wave  dynamics,  lidar  studies  have  also  shown  that  the  layer  can 
be  used  to  measure  the  temperature  of  the  upper  mesosphere  and  lower  thermosphere.  The 
temperature  profiles  are  obtained  from  measurements  of  the  Doppler-broadened  resonance  line 
width  of  the  Na  atoms  [Gibson  et  al.,  1979].  Recently,  Neuter  et  al.  [1988]  reported  extensive 
wintertime  measurements  of  the  temperature  profile  near  the  mesopause,  using  a  Na  lidar  at 
Andoya,  Norway  (69®N,  16®E). 

Lidar  measurements  at  low-  and  high-latitude  sites  have  occasionally  exhibited  the 
sporadic  developments  of  very  dense  narrow  Na  layers.  In  the  late  1970s,  Clemesha  et  al. 
[1978]  were  the  first  to  report  such  observations  at  Sao  Paulo,  Brazil  (23®S,  46®W).  More 
recently,  the  observations  of  von  Zahn  etal.  [1987]  and  von  Zahn  and  Hansen  [1988]  at 
Andoya,  Norway,  have  generated  considerable  interest  in  this  intriguing  phenomenon.  Gardner 
et  al.  [1988]  have  also  recently  reported  similar  measurements  of  sporadic  Na  layers  above 


115 


Longyearbyen,  Svalbard  (78°N,  1S°E).  The  peak  densities  of  these  sporadic  layers  sometimes 
exceed  10^  cm*^  and  can  be  as  much  as  10  times  higher  than  the  peak  densities  of  the  normal  Na 
layer.  The  thicknesses  of  the  sporadic  layers  are  very  small,  typically  of  the  order  of  1  km 
FWHM.  Because  development  times  are  usually  less  than  a  few  tens  of  minutes,  these  layers 
have  also  been  referred  to  as  "sudden  layers"  [von  Zahn  etal.,  1987].  In  this  chapter,  the 
characteristics  of  16  sporadic  Na  layers  observed  at  the  low<ladtude  site  of  Mauna  Kea,  Hawaii 
(19®50'N,  155®28'W),  during  5  nights  of  observadons  in  January  1987  will  be  discussed. 

5.2  Observations 

From  January  17  to  22, 1987,  the  University  of  Illinois  (UIUC)  lidar  system  was  used  to 
study  the  Na  layer  above  Mauna  Kea  Observatory.  The  temporal  resolution  of  the  lidar  was 
100  s,  and  the  vertical  resolution  was  150  tn.  All  the  measurements  were  made  at  zenith,  and  the 
laser  beam  divergence  was  0.4  mrad  full  width  at  the  e'^  intensity  point  The  beam  illuminated  a 
horizontal  circular  area  in  the  Na  layer,  with  a  diameter  of  about  40  m.  iK  lidar  system  is 
described  in  more  detail  by  Thompson  and  Gardner  [1987],  During  the  5  nights  of  the 
campaign,  30  hours  of  Na  measurements  were  obtained,  and  a  total  of  16  sporadic  layers  were 
observed.  The  system  operation  periods  and  statistics  of  the  sporadic  layers  are  listed  in  Table 
5.1.  At  least  one  sporadic  layer  developed  during  each  night,  except  on  January  17,  when  only 
1.5  hours  of  observations  were  obtained.  On  the  night  of  January  21,  nine  sporadic  layers 
developed  within  a  period  of  2  hours,  between  2100  and  2300  LST.  To  illustrate  the  behavior  of 
these  sporadic  layers,  a  sequence  of  density  profiles  is  plotted  in  Figure  5.1.  A  normal  density 
profile,  obtained  at  2133  LST,  is  plotted  in  Figure  5.1a.  A  small  sporadic  Na  enhancement  first 
appeared  near  105  km  at  2135  LST.  Then,  four  narrow  layers  formed  quickly  between  2200  and 
2217  LST  in  the  altitude  region  between  95  and  100  km.  The  density  profile  obtained  at  2218 
LST,  including  two  of  these  sporadic  layers,  is  plotted  in  Figure  5.1b.  Tlie  layer  at  97  km 
moved  downward  with  an  apparent  velocity  of  2.7  m  s*^  and  merged  with  the  lower  layer  near 


116 


Table  5.1.  Statistics  of  Na  Sporadic  Layers  Observed 
at  Manna  Kea,  Hawaii 


Date 

Measurement 

lune 

(LST) 

Measurement 

Duration 

(hounmin) 

Number 

Sporadic 

Layers 

January  17, 1987 

0127-0252 

1:25 

0 

January  18, 1987 

0153-0620 

4:27 

1 

January  19, 1987 

2115-2118 

0:03 

1 

January  20, 1987 

0053  -  0622 

5:29 

2 

January  20-21, 1987 

2052  -  0618 

9:26 

2 

January  21-22,  1987 

2059  -  0601 

9:02 

10 

Total 

29:53 

16 

117 


JAMIAKY  2JL.  1907 


JANUAKY  21.  1907 


JANUAKY  21.  1997 


Figure  5.1.  Sodium  density  profiles  measured  at  a)  2133  LST,  b)  2218  LST,  and  c)  2224  LST 
at  Mauna  Kea  Observatory  on  January  21, 1987. 


118 


95  km  at  2224  LST,  as  seen  in  Figure  5.1c.  The  lower  layer  reached  a  maximum  density  of  2.8 
X 10^  cm*^,  with  a  thickness  of  2.0  km  FWHM.  Although  the  peak  density  decreased  after  2224 
LST,  this  layer  was  observed  continuously  for  almost  8  hours.  During  the  night,  additional 
sporadic  layers  fonned  near  103  km,  as  seen  in  Hgure  5.2.  The  dominant  sporadic  layer  moved 
downward  from  about  96  to  87  km,  with  a  te'.>iarkably  constant  velociQr  of  31.4  cm  s'L  The 
altitude  of  this  layer  is  plotted  versus  time  in  Rgure  5.3.  The  straight  line  drawn  in  Figure  5.3  is 
a  linear  regression  fit,  which  was  used  to  estimate  the  average  vertical  velocity.  When  the  layer 
descended  to  an  altitude  of  91  km  at  0255  LST,  the  peak  density  began  increasing  from 
approximately  3000  cm*^,  reaching  a  maximum  density  of  about  9280  cm-3  at  0407  LST,  as 
seen  in  Figure  5.4. 

The  peak  density  of  the  dominant  layer  is  plotted  versus  time  in  Figure  5.5.  When  the 
peak  density  fust  reached  the  maximum  at  2224  LST,  the  column  abundance  of  the  sporadic 
layer  was  6.7  x  10^  cm*^,  a  value  which  was  74%  of  the  coluirui  abundance  of  the  entire  Na 
layer.  About  13  min  before  the  sporadic  layer  began  forming,  the  Na  abundance  in  the  region  of 
the  layer  was  only  1.1  x  10^  cm*^,  and  the  column  abundance  of  the  entire  Na  layer  was  3.5  x 
10^  cm*^.  Thus  when  it  was  most  prominent,  the  sporadic  layer  contained  a  substantial  amount 
of  Na,  which  almost  doubled  the  abundance  of  the  entire  layer.  The  rapid  growth  and  decay  of 
the  first  maximum  of  this  sporadic  layer  lasted  for  only  40  min.  The  average  Na  production  and 
decay  rates  were  28.4  and  12.6  cm-3  s-^  respectively.  The  maximum  production  and  decay  rates 
were  60.9  and  68.0  cm-3  s-i^  respectively.  Recently,  von  Zahn  and  Hansen  [1988]  defined  a 
parameter,  called  the  sttength  factor,  to  classify  the  sporadic  layers  that  they  observed.  The 
strength  factor  is  defined  as  the  ratio  of  the  maximum  peak  density  of  the  sporadic  layer  to  the 
density  of  the  normal  layer  at  the  altitude  of  the  peak  of  the  sporadic  layer.  The  strength  factor  of 
the  sporadic  layer  illustrated  in  Figure  5.1c  was  13.8.  The  second  rapid  enhancement  of  this 
layer  occurred  about  6  hours  after  the  initial  formation.  The  average  Na  production  and  decay 


119 


ALTITUDE  (km) 


JANUARY  21,  1987 


SODIUM  DENSITY  (cm'^) 


Figure  5.2.  Sodium  density  profile  measured  at  2306  LST  on  January  21, 1987  at  Mauna  Kea 
Observatory. 


2 


[*] 


JANUARY  21-22.  1987 


LOCAL  STANDARD  TIME 


Figure  5.3.  Temporal  variation  of  the  altitude  of  the  dominant  sporadic  Na  layer  observed 
Januiuy  21-22, 1987  at  Mauna  Kea  Observatory.  TTte  straight  line  is  a  linear 
regression  fit  which  was  used  to  estimate  the  average  vertical  velocity. 


121 


SODIUM  DENSITY  (cm  ) 


Figure  5.4.  Sodium  density  profile  measured  at  0407  LST  on  January  22, 1987  at  Mauna  Kea 
Observatory. 


PEAK  DENSITY  (xlO^cm 


3.0 


JANUARY  21-22,  1987 


Figure  5.5.  Temporal  variation  of  the  density  at  the  peak  of  the  dominant  sporadic  Na  layer 
observed  on  January  21-22, 1987  at  Mauna  Kea  Observatory. 


123 


rates  for  the  second  maximum  were  1.0  and  2.2  cm*^  s*^  respectively,  and  the  strength  factor 
was  6.8. 

The  full,  top-half,  and  bottom-half  widths  of  the  dominant  sporadic  layer  are  plotted 
versus  time  in  Figure  5.6.  The  widths  were  determined  at  80%  of  the  peak  density.  When 
examining  these  widths,  it  is  helpful  to  remember  that  the  full  width  determined  at  the  80%  points 
is  1.34  times  larger  than  the  rms  width  of  a  Gaussian  layer,  'fhe  full  width  was  very  narrow,  of 
the  order  of  1  km,  during  the  development  periods  of  the  two  maximum  peak  densities.  Then,  as 
the  peak  density  decreased,  the  full  width  became  larger.  The  average  full  width  measured  over 
the  total  observadon  period  was  1.25  km.  The  increase  of  the  full  width  from  0100  to  0300  LST 
was  largely  due  to  an  increase  of  the  top-half  width.  During  this  period  the  full  width  increased 
at  the  rate  of  14.6  cm  S'^  the  top-half  width  increased  at  the  rate  of  1 1.9  cm  s*^  and  the  bottom- 
half  width  increased  at  only  2.7  cm  s'^  After  the  second  maximum  peak  density  near  0410  LST, 
both  top-  and  bottom-half  widths  began  growing,  resuldng  in  a  relatively  fast  growth  of  the  full 
width.  The  average  top-  and  bottom-half  widths  over  the  total  observation  period  were  0.71  and 
0.54  km,  respectively. 

Although  the  broadening  of  the  sporadic  layer  might  suggest  that  the  Na  diffused  into  the 
normal  layer,  most  of  the  Na  atoms  apparently  disappeared  as  the  peak  density  of  the  sporadic 
layer  decreased.  The  column  abundance  of  the  entire  Na  layer  observed  on  the  night  of  January 
21  is  plotted  in  Figure  5.7,  The  maximum  near  2220  LST  occurred  when  the  peak  density  of  the 
sporadic  layer  was  at  its  first  maximum.  After  this  maximum  the  column  abundance  decreased 
rapidly  as  the  peak  density  of  the  sporadic  layer  decreased.  Therefore  it  appears  that  most  of  the 
Na  in  the  sporadic  layer  simply  disappeared  and  was  not  redistributed  throughout  the  rest  of  the 
Na  layer. 

In  addition  to  the  dominant  layer,  another  sporadic  layer  formed  near  103  km  at  2251 
LST  (Figure  5.2).  The  peak  density  and  altitude  of  this  sporadic  layer  arc  plotted  versus  time  in 
Figure  5.8.  The  highest  peak  density  was  3390  cm-^.  The  strength  factor  was  16.3,  a  value 


124 


WIDTH  (km) 


JANUARY  21-22.  1987 


LOCAL  STANDARD  TIME 


Figure  5.6.  Temporal  variations  of  the  widths  of  the  dominant  sporadic  Na  layer  observed  on 
January  21-22,  1987  at  Mauna  Kea  Observatory. 


125 


COLUMN  ABUNDANCE  (xlO  cm 


9 


LOCAL  STANDARD  TIME 


Figure  5.7.  Temporal  variation  of  the  column  abundance  measured  on  January  21-22, 1987  at 
Mauna  Kea  Observatory. 


26 


PEAK  DENSITY  (xlO'^cm 


JANUARY  21.  1907 


LOCAL  STANDARD  TIME 


Figure  5.8.  Temporal  variations  of  the  peak  density  and  altitude  of  the  high  altitude  sporadic  Na 
layer  observed  on  January  21, 1987  at  Mauna  Kea  Observatory. 


127 


PEAK  ALTITUDE  (KM) 


which  is  higher  than  that  of  the  dominant  sporadic  layer.  The  higher  strength  factor  was  due  to 
the  very  small  density  of  the  normal  Na  layer  at  this  high  altitude.  From  225 1  to  2301 LST  the 
peak  density  of  this  sporadic  layer  increased  rapidly  as  the  peak  altitude  moved  downward. 

Then,  fiom  2301  to  2330  LST,  the  peak  density  stabilized  at  a  value  near  2800  cm-^  as  the  peak 
altitude  also  stabilized  near  102  km.  During  the  rapidly  changing  period  the  Na  production  rate 
and  vertical  velocity  were  5.2  cm*^  s*^  and  -1.8  m  S'^  respectively.  During  the  stable  period  the 
Na  density  decreased  slightly.  The  decay  rate  and  vertical  velociQr  of  the  layer  were  0.1  cm*^  s*^ 
and  -22.8  cm  s-^  respectively. 

Another  sporadic  layer  with  a  high  peak  density  formed  near  97  km  at  2217  LST  (Figure 
5.  lb).  This  layer  lasted  for  6  min  and  was  present  in  only  four  density  profiles.  The  layer 
moved  downward  rapidly  and  merged  with  the  dominant  layer  at  95  km.  The  apparent  vertical 
velocity  of  the  peak  was  remarkably  constant  at  -2.7  m  s*^  The  altitude  of  the  layer  is  plotted 
versus  time  in  Figure  5.9.  The  four  measurement  points  are  represented  by  the  crosses.  The 
highest  peak  density  of  this  short-lived  layer  was  17,680  cm-^,  and  the  maximum  Na  production 
rate  was  92.7  cm-3  s'^ 

The  16  sporadic  layers  are  organized  in  decreasing  order  of  the  highest  peak  densities  and 
then  numbered.  The  characteristics  of  each  layer  are  listed  in  Appendix  3.  The  characteristics  of 
the  dominant  sporadic  layer  of  January  21-22  are  divided  into  two  groups  (1  and  5), 
corresponding  to  the  two  development  periods  near  2200  and  0300  LST. 

The  most  significant  pattern  of  the  sporadic  layers  appears  to  be  the  occurrence  times. 
Lidar  observations  were  conducted  near  0300  LST  on  4  days  of  the  campaign.  During  3  of  these 
4  days,  four  sporadic  layers  (5, 7, 11,  and  14)  began  forming  within  the  short  time  span  of  15 
min  from  0253  to  0308  LST.  Even  on  January  21,  when  narrow  sporadic  layers  did  not  form 
during  this  period,  a  rapid  increase  of  Na  density  between  90  and  95  km  was  observed  near  0300 
LST.  The  altimdes  of  the  sporadic  layers  are  plotted  versus  time  in  Figure  5. 10.  A  total  of  10 
sporadic  layers  began  forming  between  2100  and  2300  LST,  but  no  layers  formed  between 


128 


22:16  22:17  22:18  22:19  22:20  22:21  22:22  22:23 

LOCAL  STANDARD  TIME 


Figure  5.9.  Temporal  variation  of  the  altitude  of  the  short-lived  sporadic  Na  layer  observed  on 
January  21, 1987  at  Mauna  Kea  Ob.servatory.  Crosses  represent  the  four 
measurement  points. 


ALTITUDE  (km) 


JANUARY  18-22.  1987 


LOCAL  STANDARD  TIME 


Figure  5.10.  Tenjporal  variations  of  the  altitudes  of  the  sporadic  Na  layers  measured  from 

January  18  to  22, 1987  at  Mauna  Kea  Observatory.  Dots  represent  the  times  and 
altitudes  of  the  maximum  peak  densities.  Squares,  triangles,  circles,  and 
diamonds  represent  starting  times  and  altitudes.  The  sporadic  layers  are  numbered 
in  accordance  with  Appendix  3. 


130 


2300  and  0250  LST.  In  Table  5  the  characteristics  of  the  layers  which  formed  between  2100 
and  2300  LST  are  compared  with  those  of  the  layers  which  formed  near  0300  LST.  The 
difference  in  the  mean  times  of  the  maximum  peak  densities  of  the  two  groups  of  the  layers  was 
about  6  hours.  In  general,  the  layers  of  the  early  morning  had  lower  starting  altitudes,  longer 
durations,  and  longer  foimation  and  dissipation  periods  than  the  layers  of  the  late  evening.  A 
total  of  nine  sporadic  layers  began  forming  in  the  altitude  range  between  94  and  99  km,  two 
between  91  and  93  km,  and  three  between  100  and  104  km.  The  highest  and  lowest  peak 
altitudes  were  104.4  km  Gayer  16)  and  86.6  km  (layer  5),  respectively.  A  total  of  10  layers 
moved  downward,  four  moved  upward,  and  one  moved  upward  and  then  downward.  Three 
longest  duration  layers  (1, 5,  and  7)  moved  downward,  with  mean  apparent  velocities  of  about 
30  cm  s*L  Six  layers  moved  at  very  high  velocities,  ranging  from  1.6  to  2.9  m  s'K  All  six  of 
these  layers  were  observed  on  January  21  when  the  dominant  layer  developed.  In  general,  the 
top-half  widths  were  larger  than  the  bottom-half  widths,  and  the  broadening  speeds  of  the  top- 
half  widths  were  also  considerably  faster  than  those  of  the  bottom-half  widths.  The  formation 
periods  were  generally  shoner  than  the  decay  periods. 

The  general  characteristics  of  the  prominent  sporadic  layers  observed  in  Sao  Paulo, 
Brazil,  Andoya,  Norway,  and  Hawaii  arc  compared  in  Table  5.3.  The  prominent  layer  of  Sao 
Paulo  is  chosen  from  two  reported  events  in  the  literature  [Clemesha  et  al.,  1978;  1980],  and  that 
of  Andoya  from  10  reported  events  [von  Zahn  and  Hansen,  1988].  The  prominent  sporadic 
layers  above  Hawaii  and  Sao  Paulo  were  observed  during  winter  periods,  whereas  those  above 
Andoya  were  observed  during  summer.  Most  of  the  characteristics  are  surprisingly  similar.  In 
fact,  the  starting  altitudes  are  ~  95  km,  and  peak  densities  and  vertical  velocities  are  all 
comparable. 


131 


Table  S.2.  Comparison  of  Sporadic  Na  Layers  Observed  in  the  Late 
Evening  and  Early  Morning  at  Mauna  Kea,  Hawaii 


Number  of  Observadons 
Mean  Starting  Time 
Mean  Starting  Altitude  (km) 

Mean  Time  of  Maximum  Density 
Mean  Altitude  of  Maximum  Density  (km) 
Mean  Duration  (min)^ 

Mean  Formation  Period  (min) 

Mean  Dissipation  Period  (min) 


Late  Evening 
Occurrences 

Early  Morning 
Occurrences 

12 

4 

2207  LST  (±38  min) 

0301  LST  (±7  min) 

97.8  (±3.8) 

93.8  (±2.2) 

2217  LST  (±42  min) 

0359  LST  (±42  min) 

97.2  (±3.2) 

92.6  (±1.9 ) 

55.2  (±73.3) 

137.5  (±72.5) 

6.1  (±4.1) 

33.9  (±30.3) 

18.3  (±18.3) 

51.8  (±30.8) 

^In  some  cases  duration  was  larger  than  sum  of  formation  and  dissipation  periods  because 
maximum  density  of  the  sporadic  layer  was  maintained  for  several  minutes. 


132 


Table  S3»  Characteristics  of  the  Most  Prominent  Sporadic 
Na  Layers  Reported  in  Literature 


Location 

Sao  Paulo, 

Brazil^ 

(23®S,46"W) 

Andoya, 

Norway^ 

(69®N,  16®E) 

Mauna  Kea, 
Hawaii® 

(20“N,  155'’W) 

Dale 

Aug.  26, 1979 

Aug.  20,1986 

Jan  21-22, 1987 

Starting  Time 

0100  LST 

2209  LST 

2211  LST 

Absolute  Peak  Dena^ 

4  X  10*  cm*^ 

2.3  X  10*  cm-^ 

2.8  X  10*  cm*^ 

Strength  Factor 

12 

24 

14 

Width 

2km 

O.SkmFWHM 

2kmFWHM 

Formadon  Period 

15  min 

7  min 

13  min 

Duradon  of  Sporadic  Layer 

30  min 

4  hour 

8  hour 

Starting  Aldtude 

95  km 

95  km 

95  km 

Verticsd  Velocity  of  Peak 

NA 

-28  cm  s*' 

-31  cms*' 

NA,  not  available. 

^From  Clemesha  et  al.  [1980]. 

^rom  von  Zahn  et  al.  [1987]  and  von  Zahn  and  Hansen  [1988]. 
cprom  this  paper. 


133 


5.3  Discussion 

The  mechanisms  responsible  for  creating  the  sporadic  Na  layers  have  been  the  subject  of 
much  speculation.  On  the  one  hand,  von  Zahn  and  Hansen  [1988]  suggested  that  the  sporadic 
layers  were  formed  when  auroral  excitation  caused  Na  to  be  evaporated  from  the  surfaces  of 
mesospheric  dust  particles.  On  the  other  hand,  Clemesha  et  al.  [1980]  suggested  that  the 
sporadic  layers  were  vapor  clouds  created  by  meteoric  ablation.  Both  groups  noted  that  the 
prevailing  winds  would  rapidly  transport  the  sporadic  layers  horizontally.  By  making 
measurements  with  a  steerable  lidar  system  and  two  photometers  located  at  two  sites  separated  by 
107  km,  Clemesha  et  al.  [1980]  estimated  that  the  horizontal  velocities  of  the  sporadic  layers 
observed  at  Sao  Paulo  were  of  the  order  of  200  m  s*^ .  The  photometer  measurements  exhibited 
increases  in  Na  D-line  intensities  during  the  occurrence  of  the  sporadic  layers.  However,  no 
changes  in  atomic  oxygen  and  hydroxyl  emissions  were  observed.  In  November  1987, 
increases  of  Na  D-line  intensities  were  also  measured  at  Longyearbyen,  Svalbard,  during  the 
occurrence  of  a  sporadic  Na  layer  that  was  observed  with  the  UIUC  lidar  [Gardner  et  al.,  1987, 
and  Roger  Smith,  University  of  Alaskti,  private  communication,  1987]. 

The  sporadic  Na  layers  appear  to  be  related  to  sporadic  E  layers.  Simultaneous 
occurrences  of  sporadic  E  and  Na  layers  at  almost  identical  altitudes  were  reponed  by  both  von 
Zahn  and  Hansen  [1988]  and  Clemesha  etal.  [1980].  Both  groups  argued  that  the  mechanisms 
creating  the  sporadic  E  layers  were  also  responsible  for  the  sporadic  Na  layers.  However,  the 
argument  does  not  explain  completely  the  sporadic  E  and  Na  layers  observed  in  Hawaii.  During 
the  Na  lidar  campaign,  an  ionosonde  was  operated  on  Maui  (28®48'N,  156®30'W),  which  is 
located  about  ISO  km  northwest  of  the  lidar  site.  lonograms  were  routinely  recorded  every  15 
min.  On  a  total  of  eight  occasions,  sporadic  E  layers  formed  above  Maui  at  almost  the  same 
times  and  altitudes  as  the  sporadic  Na  layem  above  the  lidar  site  at  Mauna  Kex  Seven  of  these 
eight  sporadic  Na  layers  formed  within  a  shon  period  of  50  min  on  the  night  of  January  21,  in 
the  altitude  region  from  94.5  to  103.35  km.  However,  during  the  occurrences  of  the  remaining 


134 


eight  sporadic  Na  layers,  no  sporadic  E  layers  were  observed.  For  example,  sporadic  E  layers 
appeared  during  the  development  period  of  the  dominant  sporadic  Na  layer  of  January  21,  but  no 
sporadic  E  layers  appeared  when  the  density  of  this  Na  layer  began  increasing  to  its  second 
maximum  near  0300  LST.  In  fact,  for  the  four  sporadic  Na  layers  that  formed  in  the  early 
morning  near  0300  LST  no  corresponding  sporadic  E  layers  were  observed.  Furthermore, 
sporadic  E  layers  often  formed  above  Maui  when  no  sporadic  Na  layers  were  observed  above 
Mauna  Kea. 

The  winds  induced  by  diurnal  ddes  appear  related  to  the  formation  of  some  of  the 
sporadic  Na  layers  observed  in  Hawaii.  The  three  longest  duradon  sporadic  Na  layers  (1,5,  and 
7  in  Appendix  3)  had  average  downward  verdcal  velocides  of  28, 33,  and  35  cm  S'^  During 
winter  the  diurnal  dde  has  been  observed  to  be  dominant  at  the  low-ladtude  sites  of  Punta 
Borinquen  (18®N)  [Bernard  etal.,  1981]  and  Arecibo  (18®N)  [Mathews,  1976].  The  estimated 
vertical  wavelengths  of  the  diurnal  ddes  range  from  25  to  30  km  at  the  altitudes  of  the  Na  layer. 
The  corresponding  vertical  phase  velocides  of  the  diurnal  dde  are  estimated  to  range  between  29 
and  35  cm  s*^.  The  three  long-lived  sporadic  layers  observed  at  Mauna  Kea  appear  to  move 
downward  with  these  same  velocides.  Another  indication  that  there  is  a  link  between  the  tidally 
induced  winds  and  the  sporadic  Na  layers  is  the  similarity  in  the  Na  layer  profiles  measured  on 
different  days.  In  Hgure  5. 1 1  are  plotted  Na  density  profiles  measured  near  0300  LST  on 
January  18, 21,  and  22.  The  main  features  of  the  profiles  such  as  the  sharp  increase  of  density 
near  90  km  and  slow  decrease  above  90  km  are  apparent  in  all  the  profiles.  The  winds  induced 
by  the  diurnal  tide  may  be  responsible  for  this  daily  repeated  profile  shape.  Sporadic  Na  layers 
developed  near  90  km  shortly  after  the  profiles  of  January  18  and  22  were  measured.  For 
example,  on  January  22  the  peak  at  90  km  developed  into  sporadic  layer  5  about  2  min  later. 

Five  hours  earlier  this  peak  was  near  96  kra  and  had  developed  into  the  dominant  sporadic  layer 
plotted  in  Figure  5.ic.  The  peak  moved  downward  at  a  velocity  of  31.4  cm  s'^. 


135 


115 


110  h- 


105 


-  100 


lU 

a 

3 

►- 

w 

-i 

<r 


95 


90 


85 


80  ^ 


02:56 

OflN  18, 1987 


03:01 

3AN  21, 1987 


02:55 

JAN  22, 1987 


Figure  5.1 1.  Sodium  density  profiles  measured  at  0256  LST  on  January  18,  0301  LST  on 
January  20,  and  0255  LST  on  January  22,  1987. 


I 


Although  the  vertical  velocities  of  the  long-lived  sporadic  layers  were  comparable  with 
the  phase  velocity  of  the  diurnal  tide,  the  periods  and  vertical  wavelengths  of  the  waves 
responsible  fc»r  the  observed  velocities  could  not  be  determined  from  the  Na  lidar  data.  The 
altitudes  of  local  density  maxima  observed  on  the  night  of  January  21  are  plotted  in  Figure  5.12. 
The  measurement  times  of  the  density  profiles  of  Figures  S.lb,  S.lc,  5.2,  and  5.4  are  marked  on 
the  bottom  of  Figure  5.12.  The  dominant  spOTadic  layer  that  started  near  95  km  at  2200  LST 
moved  steadily  downward  during  the  measurement  period.  Note  that  other  density  maxima  also 
moved  downward,  with  similar  velocities.  The  maximum  near  91  km,  starting  at  2200  LST, 
moved  downward  for  about  2.5  hours,  with  a  velocity  of  32  cm  s*^;  the  maximum  near  108  km, 
starting  at  0200  LST,  moved  downward  for  about  3  hours,  with  a  velocity  of  38  cm  s*L  The 
maximum  near  98  km,  starting  at  0430  LST,  also  moved  downward  for  about  1.5  hours,  with  a 
velocity  of  33  cm  s*^.  If  a  diurnal  tide  with  a  vertical  wavelength  of  25-30  km  was  dominating 
the  Na  layer  dynamics,  the  Na  density  maxima  created  by  the  tidally  induced  winds  would  move 
downward  with  the  diurnal  vertical  phase  velocities.  However,  it  is  not  clear  how  the  tidally 
induced  winds  would  cause  the  sporadic  Na  layers  to  move  downward  with  the  same  velocity. 
On  the  night  of  January  21,  there  were  at  least  these  four  groups  of  density  maxima,  including 
the  dominant  sporadic  layer,  moving  downward  at  about  28-39  cm  s-^.  The  average  vertical 
distance  between  these  groups  of  the  maxima  was  about  6  km  between  2210  and  0050  LST, 
about  15  km  between  0210  and  0500  LST,  and  about  9  km  between  0430  and  0600  LST. 

During  the  formation  periods  the  apparent  vertical  movements  of  the  sporadic  layers  also 
appear  to  be  influenced  by  the  prevailing  horizontal  winds.  During  the  formation  periods  of  10 
sporadic  layers,  the  upward  motions  of  the  sporadic  layers  and  Na  density  maxima  were 
observed.  The  maximum  velocity  of  the  apparent  upward  motions  was  about  1.6  m  s'^ 

Because  this  upward  velocity  was  quite  large  and  because  the  upward  motions  were  observed  to 
change  into  downward  motions  within  a  very  short  period  of  a  few  minutes,  vertical  winds  are 
probably  not  responsible  for  the  motions.  Instead,  the  advection  of  the  tilted  layers  across  the 


137 


ALTITUDE  (km) 


JANUARY  21-22,  1907 


LOCAL  STANDARD  TIME 


Figure  5.12.  Temporal  variations  of  the  altitudes  of  local  Na  density  maxima  on  January  2 1  -22, 
1987  at  Mauna  Kea  Observatory.  The  measurement  times  of  the  density  profiles 
of  Figures  5.1b),  5.1c),  5.2,  and  5.4  are  marked  on  the  bottom  of  this  figure. 


L38 


lidar  site  by  prevailing  horizontal  winds  might  be  responsible  for  the  apparent  motions.  To 
illustrate,  the  altitudes  of  the  density  maxima  and  minima  observed  on  the  night  of  January  21  are 
plotted  in  detail  from  2100  to  2330  LSI  in  Figure  5.13.  The  measurement  times  of  the  density 
profiles  of  Figures  5.1a,  5.1b,  5.1c,  and  5.2  are  mariced  on  the  bottom  of  Figure  5.13.  The 
altitudes  of  the  sporadic  layers  are  represented  by  solid  lines,  the  altitudes  of  other  density 
maYiim  by  dashed  lines,  and  the  altitudes  of  density  miiuma  by  dotted  lines.  During  this  period 
a  total  of  nine  sporadic  layers  were  observed.  From  2140  to  2223  LST,  three  sporadic  layers  (2, 
4,  and  16)  moved  downward  at  an  average  velocity  of  2  m  s*^  in  the  altitude  region  between  96 
and  105  km.  Then,  as  the  downward  movements  disappeared,  the  dominant  sporadic  layer  (1) 
reached  the  highest  peak  density  at  2224  LST.  From  2226  to  2251  LST,  two  sporadic  layers  (9 
and  15),  along  with  other  density  maxima,  moved  upward  at  an  average  velocity  of  1.6  m  s*^. 
The  dominant  layer  also  moved  upward  at  a  velocity  of  t^proximately  1.6  m  s*^  from  2215  to 
2229  LST.  Between  2229  and  2231  LST  the  layer  was  displaced  from  96.5  to  95.6  km.  Then, 
from  2231  to  2242  LST  the  layer  moved  upward  again,  at  a  velocity  of  approximately  1.5  m  s-L 
After  2242  LST  the  layer  began  moving  downward  at  31  cm  s*^  and  maintained  this  velocity  for 
the  next  7  hours.  Sporadic  layer  12  near  103  km  began  developing  at  2251  LST,  when  the 
upward  motion  of  the  sporadic  layer  9  changed  to  downward  motion. 

5.4  Summary 

The  maximum  densities  at  the  peaks  of  the  sporadic  Na  layers  observed  above  Mauna 
Kea  were  of  the  order  of  104  cm*^,  and  the  full  widths  measured  at  the  80%  points  were  of  the 
order  of  1  km.  The  formation  periods  of  the  layers  ranged  from  a  few  minutes  to  about  1  hour. 
Although  the  lidar  campaign  was  conducted  only  on  5  nights,  so  that  the  data  base  is  not  large, 
the  occurrence  times  of  the  sporadic  layers  appear  to  have  a  pattern.  The  layers  formed  either  in 
the  late  evening  between  2100  and  2330  LST  or  in  the  early  morning  between  0300  and  0600 
LST.  On  3  different  nights  four  layers  began  forming  within  a  short  time  span  of  15  min  from 


139 


ALTITUDE  (km) 


JANUARY  21.  1987 


LOCAL  STANDARD  TIME 


Figure  S.  1 3.  Temporal  variations  of  the  altitudes  of  local  Na  density  maxima  and  minima  tom 
2100  to  2330  LST  on  January  21,  1987  at  Mauna  Kea  Observatory.  The  solid 
curves  represent  the  altitudes  of  the  sporadic  layers,  the  dashed  curves  the  local 
density  maxima,  and  the  dotted  curves  the  local  density  minima.  The  .sporadic 
layers  are  numbered  in  accordance  with  Table  5.2.  The  measurement  times  of 
the  density  profiles  of  Figures  5.1a),  5.1b),  5.1c),  and  5.2  are  mariced  on  the 
bottom  of  this  figure. 


140 


0253  to  0308  LST.  A  total  of  10  layers  began  forming  between  2100  and  2300  LST,  but  no 
layers  formed  between  2300  and  0250  LST.  The  average  of  the  times  of  the  maximum  density 
of  the  early  morning  spcnradic  layers  occurred  about  6  hours  after  that  of  the  late  evening  layers. 

In  general,  the  formation  times  of  the  early  morning  layers  were  longer.  The  starting  altitudes  of 
the  early  morning  layers  were  also  lower.  In  addition  to  the  patterns  of  occurrence  times,  the 
approximately  30  cm  s*^  apparent  downward  velocities  of  the  three  longest  duration  layers 
suggest  that  the  mechanisms  responsible  for  creating  the  layers  may  be  related  to  the  diurnal  tide. 

The  sporadic  Na  layer  phenomena  also  appear  to  be  latitude  dependent,  because  the  layers 
have  been  observed  only  at  low*  and  high-latitude  sites.  In  fact,  at  the  mid-latitude  site  of 
Urbana,  Illinois  (40®N,  88®W),  only  one  prominent  sporadic  Na  layer  has  been  recently  observed 
during  almost  10  years  of  Na  lidar  observations.  A  few  minor  sporadic  Na  layers  were  recently 
observed  at  Urbana  in  Match  and  April  1988,  using  a  new  more  powerful  CEDAR  lidar  system 
[Beatty  et  al.,  1988].  However,  these  layers  appear  to  be  meteor  trails  and  were  not  nearly  as 
spectacular  as  those  observed  in  Mauna  Kea  and  the  Arctic.  The  characteristics  of  the  sporadic 
layers  observed  at  the  high-latitude  site  of  Andoya  and  the  low-latitude  sites  of  Sao  Paulo  and 
Mauna  Kea  are  very  similar.  Simultaneous  occurrences  of  the  sporadic  £  and  Na  layers  at  almost 
identical  altitudes  were  observed  on  many  occasions  at  all  the  three  sites.  However  at  Mauna 
Kea,  sporadic  E  and  Na  layers  were  not  always  observed  simultaneously.  In  fact,  for  the  four 
sporadic  Na  layers  of  the  early  morning  no  corresponding  sporadic  E  layers  were  observed. 
Increases  in  Na  D-line  emission  intensities  were  also  observed  during  the  occurrences  of 
sporadic  Na  layers  in  Sao  Paulo  and  the  high-latitude  site  of  Longyearbyen,  Svalbard. 

The  mechanisms  responsible  for  creating  the  sporadic  Na  layers  are  not  well  understood. 
Because  of  die  strong  correlation  in  altitudes  and  times  of  the  sporadic  Na  and  E  layers  above 
Andoya,  von  Zahn  and  Hansen  [1988]  have  argued  that  both  are  probably  created  by  the  same 
mechanism.  The  dynamic  effects  of  tides  and  waves  can  lead  to  the  development  of  very  thin 
layers  of  metallic  ions.  However,  the  chemical  processes  for  converting  Na"*"  to  neutral  Na 


141 


iq)pear  to  be  too  slow  to  explain  the  rapid  nearly  simultaneous  development  of  sporadic  E  and  Na 
layers.  Consequently,  von  Zahn  and  Hansen  [1988]  suggest  that  both  layers  are  formed  by  the 
impact  of  auroral  particles  on  upper  atmospheric  dust  and  smoke  particles.  They  argue  that  this 
process  would  evaporate  and  ionize  metals  which  are  absorbed  on  the  surfaces  of  the  dust 
particles.  Qearly,  auroral  excitation  is  not  responsible  for  the  sporadic  Na  layers  observed  above 
the  low-latitude  sites  of  Mauna  Kea  and  Sao  Paulo.  The  starting  times  of  the  16  layers  observed 
at  Mauna  Kea  were  restricted  to  the  short  2-hour  interval  between  2052  and  2251 LST  and  the 
15-min  interval  between  0253  and  0308  LST.  It  is  interesting  that  the  10  sporadic  Na  layers 
observed  on  7  different  nights  and  analyzed  by  von  Zahn  and  Hansen  [1988]  also  had  starting 
times  restricted  to  the  short  3-hour  interval  between  2159  and  0043  LST  and  had  formation 
periods  which  were  comparable  to  those  of  the  late  evening  layers  above  Mauna  Kea.  It  is 
difficult  to  see  how  auroral  excitation  would  lead  to  such  restricted  occurrence  times.  It  is  also 
puzzling  that  sporadic  Na  layers  appear  to  be  very  rare  at  mid-latitudes,  while  sporadic  E  is  not 
Qearly,  additional  observations,  particularly  at  low  latitudes,  ate  needed  to  better  characterize 
this  very  interesting  phenomenon. 


142 


6.  LIDAR  OBSERVATIONS  OF  THE  SODIUM  LAYER 
AT  SVALBARD,  NORWAY 


6.1  Overviews 

The  UIUC  group  conducted  a  total  of  five  Na  lidar  campaigns  at  Nordlysstasjonen 
(78*12^^,  15®50'E),  Svalbard,  Norway  from  July,  1987  to  April,  1988.  Table  6.1  summarizes 
the  observation  periods  of  the  five  campaigns.  The  seasonal  variations  of  the  Na  column 
abundance  measured  at  Svalbard  and  Urbana  are  compared  in  Hgure  6.1.  Symbols  denote 
average  values  for  the  observation  period,  while  lines  denote  range  of  values.  The  dashed  lines 
indicate  the  abundance  measured  at  Andoya,  Norway,  which  is  located  approximately  1000  km 
south  of  Svalbard.  The  Na  abundance  measured  at  Urbana  shows  a  distinct  annual  oscillation 
whh  a  sharp  peak  in  November  to  January  and  a  broader  minimum  in  the  summer  months.  This 
annual  oscillation  is  believed  to  be  related  to  changes  in  the  mesopause  temperature  that  affect  the 
reaction  rates  of  the  main  chemical  loss  processes  for  Na  [Swider,  1985;  Jegou  et  al.,  198Sb]. 
The  seasonal  variations  of  the  abundance  measured  at  Svalbard  also  show  an  annual  oscillation 
with  a  broad  maximum  in  January  to  April  followed  by  a  distinct  minimum  in  June.  The 
minimum  abundance  observed  in  June  is  not  well  understood,  and  will  be  discussed  further  in 
Section  6.2.  The  maximum  abundance  observed  in  January  to  April  at  Svalbard  is  comparable  to 
the  maximum  abundance  in  November  to  January  observed  at  Urbana. 

The  seasonal  variations  of  the  layer  centroid  height  measured  at  Svalbard  and  Urbana  are 
compared  in  Hgurc  6.2.  The  centroid  measured  at  Urbana  shows  a  semi-annual  oscillation  with 
maxima  near  equinox  in  March  and  September  and  miiuma  near  solstice  in  June  and  December. 
The  centroid  measured  at  Svalbard  also  shows  the  similar  semi-annual  oscillations  with  maxima 
near  equinox  in  March  and  September  and  minima  near  solstice  in  December  and  June.  During 
the  winter  months,  the  centroid  height  observed  at  Svalbard  was  generally  lower  than  the 


143 


Table  6.1.  Sodium  Lidar  Observation  Times  at  Svalbard  (1987«1988) 


Campaign 

Stait 

Date 

TimedJT) 

End 

Date 

TimedJT) 

#1 

July  10 

2200 

July  11 

2255 

July  12 

0S40 

July  12 

1045 

July  12 

1?S5 

July  12 

1835 

July  13 

1330 

July  14 

0005 

July  14 

1825 

July  14 

2320 

#2 

September? 

0800 

September? 

1150 

September? 

2055 

September? 

2335 

Septembers 

0310 

September  9 

1215 

September  10 

1725 

September  11 

0045 

#3 

October  31 

1600 

October  31 

1615 

November? 

1340 

November  9 

2230 

November  11 

1855 

November  11 

2315 

November  13 

0800 

November  13 

0930 

#4 

January  6 

1525 

January  6 

2225 

January? 

0120 

January  9 

0110 

January  9 

2300 

January  13 

1350 

January  14 

1750 

January  14 

2035 

January  IS 

0825 

January  15 

1225 

January  22 

0800 

January  22 

2200 

< 

January  24 

1650 

January  24 

2150 

#5 

April  4 

2030 

Aprils 

0130 

Aprils 

2335 

April  6 

0000 

April  6 

2045 

April? 

0140 

Aprils 

0110 

April  9 

0010 

April  10 

2150 

April  12 

0445 

April  12 

1855 

April  13 

0645 

April  13 

1735 

April  14 

0555 

April  18 

1800 

April  18 

2035 

April  19 

0055 

April  19 

0525 

144 


(78°N)  aiul  b)  :il  Urbana,  Illinois  (4()°N).  Symbols  clenoie  avenige  values  for  ihe 
obscr\'aiion  period,  while  lines  denote  range  of  values. 


centroid  observed  at  Urbana.  This  could  be  the  result  of  a  stronger  downward  motion  of  the 
atmosphere  over  the  high  ladtude  site  of  Svalbard  during  the  winter  months. 

In  Hgure  6.3,  the  seasonal  variations  of  the  layer  rms  width  measured  at  Svalbard  and 
Urbana  are  plotted.  The  rms  widtit  measured  at  Urbana  exhibits  no  clear  seasonal  variations. 
However,  the  rms  width  measured  at  Svalbard  shows  an  annual  oscillation  with  a  maximum  near 
February  and  a  minimum  near  September.  The  mechanisms  responsible  for  this  annual 
oscillation  are  not  well  understood. 

Sporadic  Na  layers  were  observed  at  Svalbard  during  every  campaign  except  the  July 
1987  campaign.  The  characteristics  of  these  sporadic  Na  layers  are  quite  similar  to  those 
observed  at  Andoya,  Norway  by  von  Zahn  and  Hansen  [1988]  and  at  Mauna  Kea  Observatory, 
Hawaii  by  Kwon  et  al.  [1988].  The  characteristics  of  the  Na  layer  observed  during  the  July  and 
September  1987  campaigns  will  be  presented  in  Section  6.2,  and  the  characteristics  observed 
during  the  November  1987  and  January  1988  campaigns  will  be  presented  in  Section  6.3. 

6.2  Lidar  Observations  at  Svalbard  in  July  and  September,  1986 

The  chemistry  and  dynamics  of  the  mesospheric  Na  layer  have  been  studied  extensively 
since  the  late  1960s  with  lidar  techniques.  Meteoric  ablation  is  generally  regarded  as  the 
dominant  source  of  all  mesospheric  alkali  metals  including  Na.  The  Na  layer  is  typically 
confined  to  the  region  between  80  and  1 10  km  with  a  peak  near  the  mesopause  at  90  km,  where 
the  density  ranges  from  about  10^  to  10^  cm-^.  The  Na  column  abundance  at  mid-latitudes  in  the 
Northern  Hemisphere  varies  from  a  summer  minimum  of  about  3x10^  cm*^  to  a  winter  maximum 
of  about  10^®  cm-2  in  December  and  January  [Gardner  et  al.,  1986].  The  seasonal  and 
geographical  variations  in  Na  abundance  are  now  believed  to  be  related  to  changes  in  the 
mesopause  temperature  that  affect  the  reaction  rates  of  the  main  chemical  loss  processes  for  Na. 
In  addition  to  chemical  activity,  the  dynamic  effects  of  the  tides,  gravity  waves  and  the  mean 
winds  have  a  significant  influence  on  the  vertical  structure  of  the  layer. 


147 


Svalbard  {78“N,I6®E) 


The  system  parametets  and  daytime  capabilities  of  the  UIUC  Na  lidar  are  described  by 
Kwon  et  al.  [1987].  Data  processing  and  analysis  are  discussed  by  Gardner  et  al.  [1986]. 

Hgure  6.4  is  a  plot  of  the  average  Na  profile  measured  during  the  period  July  10-15, 1987  at 
Nordlysstasjonen,  Svalbard.  Figures  6.5  and  6.6  are  plots  of  the  profiles  measured  near  local 
midnight  on  September  7  and  9, 1987,  respectively.  Table  6.2  lists  the  major  Na  layer 
parameters  for  these  three  profiles.  The  large  standard  deviations  in  the  parameters  measured 
during  July  are  caused  by  the  low  Na  signal  level  and  the  high  background  noise  from  the  bright 
sunlit  sky. 

Although  in  July  the  centroid  height  and  rms  thickness  are  comparable  to  values 
measured  at  other  locations,  the  peak  density  and  column  abundance  are  almost  a  factor  of  5 
lower.  These  results  seem  to  be  the  first  Na  measurements  made  near  the  North  Pole  during 
summer,  are  very  surprising,  particularly  when  compared  with  other  high  latitude  observations. 

It  is  well-known  that  Na  abundance  is  quite  high  in  the  wintertime  polar  mesosphere.  Lidar 
measurements  by  Megie  et  al.  [1978]  and  Juramy  et  a/.  [  198 1]  at  Heyss  Island,  Franz  Joseph 
Land  (8(y’N,  5(FE),  by  vonZahn  and Tilgner  [1987]  at  Andoya,  Norway  (69®N,  16®E)  and  by 
Nomura  et  al.  [1987]  at  Syowa  Station,  Antarctica  (69®S,  40®E)  show  Na  abundances  ranging 
from  3x10’  to  12x10’  cm*^  during  the  polar  winter.  In  addition,  von  Zahn  et  al.  [1988]  report 
that  Na  abundance  in  July  at  Andoya  averaged  approximately  6x10*  cm-^  which  is  consistent 
with  the  July  measurements  at  Nordlysstasjonen  approximately  1000  km  to  the  north. 

The  September  7  profile  plotted  in  Rgure  6.5  is  also  quite  interesting  because  of  the  high 
peak  density  and  small  thickness  of  the  layer.  The  column  abundance  for  this  profile  is  almost 
24  times  larger  than  the  abundance  measured  in  July.  The  2  km  half- width  and  high  peak  density 
are  characteristic  of  the  very  narrow  layers  which  von  Zahn  et  al.  [1988]  have  observed  on 
occasion  forming  rapidly  above  Andoya.  The  profile  plotted  in  Figure  6.6  is  more  typical  of  the 
layer  structure  observed  at  Nordlysstasjonen  during  the  September  campaign.  The  column 


149 


ALTITUDE  (km) 


ALTITUDE  (km) 


SEPTEMBER  7,  1987 


SODIUM  DENSITY  (cm'^) 


Figure  6.5.  Sodium  density  profile  above  Nordlysstasjoiien,  Svalbard  at  2152  UT  September 
7,  1987.  The  integration  period  was  14  min.  and  the  data  were  spatially  smoothed 
using  a  low-pass  filter  wi'h  a  cutoff  frequency  of  0.6  knv'. 


151 


ALTITUDE  (kin) 


105 


SEPTEMBER  9.  1987 


SODIUM  DENSITY  (cm"^) 


Figure  6.6.  Sodium  density  profile  above  Nordlysstasjonen,  Svalbard  at  midnight  September  9, 
1987.  The  inte^ation  period  was  40  min,  and  the  data  were  spatially  smoothed 
using  a  low-pass  filter  with  a  cutoff  frequency  of  0.6  km-^ 


I 

I 

I 

I 

I 

I 


152 


Table  6.2.  Sodium  Layer  Parameters  Measured  at  Nordlysstasjonen  (78°12'N, 
IS^IS'E),  Svalbard  During  the  July  and 
September  1987  Campaigns 


Observation 

Period 

2159UTJuly  10- 
0100  UT  July  15, 
1987 

PeakDensi^ 

(cm-3) 

550  ±50 

Column 

Abundance(cm‘2) 

(5.7±2)xl0« 

Centroid 

Height  (km) 

90.6  ±2 

RMS 

Thickness  (km) 

3.82  ±2 

2152-2206  UT 
September  7, 

1987 

0000-0400  UT 
September  9, 
1987 

(4.3±0.2)xl0* 

(7.8  ±  0.4)xl03 

(1.4±0.1)xlOio 

(4.8  ±  0.1)xl09 

94.6  ±0.2 

93.5  ±0.1 

2.52  ±  0.3 

3.23  ±0.1 

153 


abundance  (~4.8xl0^  cm*^)  is  con^arable  to  mid-ladtude  values  that  have  been  measured  in  the 
Northern  Hemispheie  in  early  September  [Gardner  et  ai.^  1986]. 

The  chemistry  of  the  Na  layer  is  still  poorly  undentood  even  though  numerous  models 
have  been  proposed  in  an  attempt  to  explain  the  general  feature  of  the  seasonal,  diurnal  and 
geographical  variations  in  the  layer.  Models  for  the  dominant  loss  mechanisms  have  generally 
evolved  in  two  directions.  One  group  of  models  has  been  developed  by  assuming  neutral 
reactions  dominate  Na  chenustry  [Kirchhoff,  1986;  Thomas  et  al.,  1983].  The  other  group 
assumes  that  ionic  species,  especially  cluster  ions  of  the  form  Na'*‘  are  of  considerable 

importance  [Richter  and  Sechrist,  1979;  Jegou  et  al.,  198Sb].  A  complete  understanding  of  the 
Na  layer  chemistry  has  been  hampered  by  the  lack  of  reliable  estimates  of  the  rate  coefficients  for 
many  of  the  important  reactions.  Even  so,  it  is  generally  agreed  that  the  seasonal  variation  in 
mesopause  temperature  is  primarily  responsible  for  the  seasonal  variation  of  Na  abundance  at 
mid-latitudes. 

Swider  [198S]  recently  reported  model  calculations  which  suggest  that  the  neutral 
reaction  Na  +  O2  +  M  -4  Na02  +  M,  is  an  important  sink  for  mesospheric  Na.  The  Na  loss  rate 
due  to  this  reaction  has  a  T-*  temperature  dependence.  Von  Zahn  and  Neuter  [1987]  report  that 
the  winter  mean  temperature  between  80  and  100  km  at  Andoya  (69®N)  ranges  from  ~200  to 

A 

220®K.  In  early  August  1982,  Philbrick  etal.  [1984]  measured  temperatures  for  the  same 
altitude  region  at  Kiruna,  Sweden  (68®N).  Their  values  were  in  the  range  from  ~100  to  140®K. 
Thus  the  Na  loss  rate  at  high  latitudes  for  the  neutral  reaction  discussed  by  Swider  could  be  as 
much  as  10  times  larger  in  summer  compared  to  winter.  However,  because  the  July  Na 
abundance  at  Nordlysstasjonen  (78®N)  is  almost  24  times  smaller  than  the  September  7  levels  and 
almost  9  times  smaller  than  the  September  9  levels,  this  process  does  not  appear  to  be  completely 
responsible  for  the  observed  summertime  depletion  above  Svalbard.  Other  loss  mechanisms 
which  may  play  significant  roles  in  Na  depletion  at  high  latitudes  include  absorption  by  polar 
mesospheric  cloud  particles,  absorption  on  smoke  or  dust  particles  and  photo-ionization. 


154 


To  summarize,  the  Na  abundance  above  Notdlysstasjonen,  Svalbard  during  July  10*15, 
1987  was  observed  to  be  almost  5  times  lower  than  typical  summertime  values  measured  at  mid- 
ladmdes  and  from  9  to  24  times  lower  than  values  measured  at  Notdlysstasjonen  during 
September  7-11, 1987.  Although  the  very  cold  temperatures  near  the  summerdme  Arcdc 
mesopause  will  increase  the  effecdveness  of  the  dominant  chemical  loss  process  for  Na,  this 
mechanism  does  not  appear  to  be  strong  enough  to  explain  the  large  depledon. 

Lidar  Observations  at  Svalbard  in  November,  1987  and  January,  1988 
During  the  November  1987  campaign,  the  longest  observadon  was  made  for  57  hours 
starting  at  1500  LST  on  November  7.  The  temporal  variadons  and  temporal  power  spectra  of  the 
Na  column  abundance,  centroid  height,  and  rms  width  measured  during  this  observadon  are 
plotted  in  Figures  6.7  through  6.9.  The  periods  of  the  most  dominant  variadons  in  the  layer 
parameters  were  approximately  24  hours  and  8  hours.  The  8-hour  period  oscilladons  were  also 
dominant  in  the  verdcal  wind  velocity  estimated  at  the  aldtude  of  98  km  plotted  in  Figure  6.10. 
The  technique  for  estimating  the  vcrdcal  wind  velocity  was  described  in  detail  by  Kwon  et  al. 
[1987]. 

During  the  January  1988  campaign,  the  longest  observadon  was  made  for  72  hours 
starting  at  local  midnight  on  January  10.  Rgurcs  6.1 1  through  6.13  show  the  temporal 
variations  and  temporal  power  spectra  of  die  column  abundance,  centroid  height,  and  rms  width 
measured  during  this  observadon.  The  periods  of  the  dominant  oscilladons  in  the  abundance  and 
rms  width  are  38, 18,  and  10  hours,  and  those  in  the  centroid  height  are  22, 13,  and  8  hours. 
The  8-hour  period  oscillations  were  also  dominant  in  the  temporal  variations  of  the  vertical  wind 
velocity  estimated  at  the  altitude  of  98  km  as  plotted  in  Figure  6.14.  It  appears  that  the  Na  layer 
observed  in  November,  1987  and  January,  1988  was  dominated  by  waves  with  periods  of 
approximately  24  hours  and  8  hours. 


155 


0  1. 0x10 


2.0x10  •’  3.0x10 

FREQUENCY  (1/min) 


4.0x10 


Figure  6.7.  a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  Na  column 

abuntfcmce  measured  during  the  57-hour  period  starting  at  1500  LST  on  November 


a) 


93.  so 


NOVEMBER  7*9.  1987 


LOCAL  STANDARD  TIME 


b) 


NOVEMBER  7-9.  1987 


Figure  6.8.  a)  Temporal  variations  and  b)  temporal  power  spectmm  of  the  Na  layer  centroid 
height  measured  during  the  57-hour  period  starting  at  1500  LST  on  November  7, 
1987. 


157 


0 


1.0x10  2.0x10  3.0x10  •’ 

FREQUENCY  (1/min) 


•1.0x10  •' 


Figure  6.9. 


a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  Na  rms  width 
measured  during  the  57-hour  period  starting  at  1500  LST  on  November  7, 1987. 


NOVEMBER  7-10.  1987 


NOVEMBER  7-10.  1907 


FREOUEHCY  (l/min) 


Figure  6. 10.  a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  venical  winds  at  the 
altitude  of  98  km.  The  vertical  winds  were  inferred  from  the  temporal  variations  of 
the  Na  density  gradients  on  the  layer  topside  measured  during  the  57-hour  period 
starting  at  1500  LST  on  November  7, 1987. 


159 


0  6  12  18  0  6  12  18  0  d  12  18  0 

LOCAL  STAWAJO  TIKE 


b) 


Figure  6.1 1. 


a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  Na  column 
Sundance  measured  during  the  72-hour  period  starting  at  local  midnight  on 
January  10, 1988. 


0 


i.OxlO  2.0x10  3.0x10 

FREQUENCY  (1/nvtn) 


<1. 0x10 


Figure  6.  i  2.  a)  Temporai  variations  and  fa)  tcmporai  power  spectrum  of  the  Na  centroid 
height  measured  during  the  72-hour  period  starting  at  local  midnight  on 
January  10, 1988. 


0  1.0x10  2.0x10  3.0x10  -l.OxlO 

FREQUENCY  (l/nin) 


Figure  6. 1 3.  a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  Na  rms  width 
measui^  during  the  72-hour  pcritxi  starting  at  focal  midnight  on  January  10, 
988. 


162 


JAHUARY  10-13.  1988 


JANUARY  10-13.  1906 


Figure  6. 14.  a)  Temporal  variations  and  b)  temporal  power  spectrum  of  the  venical  winds  at  the 
altitude  of  98  km.  Tne  venical  winds  were  inferred  from  the  temporal  variations  of 
the  Na  density  gradients  on  the  layer  topside  measured  during  the  72-hour  period 
starting  at  local  midnight  on  January  10,1988. 


163 


The  characteristics  of  the  sporadic  Na  layers  observed  at  Svalbard  are  veiy  similar  to 
those  observed  at  Andoya,  Norway  by  von  Zahn  and  Hansen  [1988]  and  at  Hawaii  by  Kwon  et 
al.  [1988].  To  illustrate,  spnadic  Na  layers  observed  during  the  November  1987  and  January 
1988  campaigns  are  plotted  in  Figure  6.15.  The  peak  densities  and  thicknesses  of  these  sporadic 
Na  layers  are  quite  comparable  to  those  observed  at  Andoya  and  Hawaii  The  sporadic  Na  layer 
observed  on  January  10  developed  at  the  altitude  of  about  85  km,  which  appears  to  be  the  lowest 
altitude  of  the  sporadic  Na  layers  observed  with  the  UIUC  lidar.  Another  sporadic  Na  layer 
observed  during  the  November  1987  campaign  is  plotted  in  Figure  6.16.  The  temporal  evolution 
of  the  bifurcation  of  this  sporadic  Na  layer  is  plotted  in  Figure  6.16  b. 


164 


7.  CONCLUSIONS  AND  RECOMMENDATIONS 


7.1  Conclusions 

A  data  analysis  technique  for  determining  gravity  wave  intrinsic  parameters  is  described 
in  Chapter  2.  The  intrinsic  parameters  include  the  horizontal  and  vertical  wavelengths,  period, 
and  horizontal  propagation  direction.  The  technique  involves  measuring  the  altitude  variations  of 
the  wave  induced  density  perturbations  in  the  Na  layer.  This  technique  can  be  used  with  airborne 
lidars,  multiple  ground-based  lidars,  and  steerable  lidars.  Several  examples  have  been  presented 
to  show  the  expected  measurement  errors  for  the  wave  parameters.  The  technique  is  applied  to 
the  airborne  Na  lidar  data  obtained  during  the  westward  flight  conducted  in  November  1986. 
During  the  flight,  strong  wave  perturbations  were  observed  in  the  Na  layer  near  the  Pacific  Coast 
over  a  horizontal  distance  of  nearly  700  km.  The  intrinsic  horizontal  wavelength  of  this  wave 
was  estimated  to  be  about  85  km,  and  the  vertical  wavelength  was  4.1  km.  The  intrinsic  period 
was  about  1.7  hours,  and  the  propagation  direction  was  almost  due  south. 

Kinetic  energy  horizontal  and  vertical  wavenumber  spectra  of  horizontal  winds  are 
presented  in  Chapter  3.  The  spectra  have  been  inferred  from  the  airborne  lidar  data  collected 
during  the  eastward  and  westward  flights.  The  average  slope  of  the  horizontal  wavenumber 
spectra  was  -1.25  at  horizontal  scales  ranging  from  70  to  700  km,  and  the  average  slope  of  the 
vertical  wavenumber  spectra  was  -2.67  at  vertical  scales  from  2  to  10  km.  The  altitude  range  of 
the  measurements  was  approximately  82  to  101  kra  The  slopes  of  the  horizontal  wavenumber 
spectra  computed  only  for  the  bottomside  (82-90  km)  of  the  Na  layer  were  consistently  steeper 
than  those  computed  for  the  topside  (90-101  km).  The  average  slope  of  the  bottomside 
horizontal  wavenumber  spectra  was  -1.51,  and  that  of  the  topside  spectra  was  -0.97. 

Internal  gravity  waves  appear  to  be  responsible  for  major  features  of  the  airborne  Na  lidar 
data.  The  observed  features  include  the  systematic  horizontal  and  vertical  variations  of  the  Na 
density  profiles,  the  horizontal  variations  of  the  centroid  height,  the  presence  of  distinct  spectral 


167 


peaks  in  the  horizontal  wavenumber  spectra,  and  Doppler-shifting  of  these  spectral  peaks.  It  is 
difficult  to  interpret  these  observed  features  in  terms  of  turbulence.  However,  it  seems  possible 
that  the  dominant  density  perturbations  observed  in  the  lidar  data  at  horizontal  scales  in  the  range 
from  20  to  2000  km  are  due  to  gravity  waves. 

The  slq)es  of  the  vertical  wavenumber  spectra  agree  well  with  the  slopes  calculated  fix>m 
radar  observations.  The  slopes  of  the  hOTuontal  wavenumber  spectra  of  the  airborne  lidar  data 
are  shallower  than  the  slopes  of  the  GASP  and  shutde  re-entry  spectra.  In  general,  the  rms 
horizontal  wind  velocities  measured  with  the  airborne  lidar  increased  with  time  and  with 
longitude  born  the  Pacific  Coast  to  the  Great  Plains.  These  rms  horizontal  wind  velocities  were 
comparable  with  those  measured  with  ground-based  lidars  in  Hawaii,  Illinois,  and  Maryland. 
Ihis  appears  to  indicate  that  the  magninides  of  the  horizontal  winds  over  the  Pacific  Ocean  are 
comparable  with  those  over  the  land  masses. 

During  the  eastward  flight,  two  quasi-monochromatic  waves  were  observed-  One  wave 
had  a  much  longer  zonal  wavelength.  The  parameters  of  these  waves  were  computed  by  using 
the  Doppler  shifted  zonal  wavelengths  measured  during  the  eastbound  and  westbound  flight  legs. 
The  longer  wavelength  wave  had  an  intrinsic  zonal  wavelength  of  772  km,  intrinsic  zonal  phase 
velocity  of  35  m  S'l  westward,  and  intrinsic  period  of  6.1  hours.  This  wave  appears  to  be 
propagating  almost  due  west,  and  is  believed  to  be  an  atmospheric  tide.  The  shorter  wavelength 
wave  had  an  intrinsic  zonal  wavelength  of  263  km,  zonal  phase  velocity  of  43  m  s'^  westward, 
and  period  of  1.7  hours.  This  wave  also  appears  to  be  propagating  westward. 

In  Chapter  4,  the  results  of  a  joint  lidar/radar  campaign  conducted  in  November,  1986  are 
presented.  The  lidar  campaign  included  the  airborne  observations  and  ground-based 
observations  at  Broomfield  and  Denver,  Colorado.  The  radar  observations  were  obtained  at 
Platteville,  Colorado  with  the  ST  radar.  The  radar  observations  showed  a  dominant  vaiiation  in 
the  horizontal  wind  perturbations  with  a  period  of  6  hours.  The  ground-based  and  airborne  lidar 
observations  also  showed  dominant  Na  density  variations  by  waves  with  periods  of  6  hours. 


168 


These  6-hour  period  variations  observed  with  the  radar,  ground-based  lidar,  and  airborne  lidar 
were  most  dominant  at  the  altimdes  which  correspond  to  the  bottomside  of  the  layer.  It  appean 
that  the  6-hour  period  waves  observed  with  the  ground-based  and  airborne  lidars  are  responsible 
for  the  horizontal  wind  variation  with  a  period  of  6  houn  observed  with  the  radar.  The  data 
obtained  with  both  the  ground-based  and  airborne  lidars  also  exhibited  dominant  Na  density 
variations  by  waves  with  periods  of  approximately  2  hours.  These  2-hour  period  waves  were 
also  dominant  only  on  the  bottomside  of  the  layer. 

The  charaaeristics  of  sporadic  Na  layers  observed  above  Mauna  Kea,  Hawaii  (20*’N, 
ISS'^W)  are  described  in  Chapter  5.  These  layers  were  observed  on  a  total  of  16  occasions 
during  30  hours  of  lidar  meastirements  from  January  17  to  22, 1987.  The  most  prominent 
sporadic  layer,  which  formed  on  the  night  of  January  21,  exhibited  a  peak  density  of  2.8  x  10^ 
cm*3  near  96  km  with  a  full  width  of  about  2  km.  The  rapid  growth  and  decay  of  the  layer  lasted 
for  about  40  min.  The  apparent  Na  production  and  decay  rates  of  this  layer  were  approximately 
60  cm*5  s*^  Even  after  the  decay,  the  narrow  layer  was  observed  continuously  for  almost  8 
hours.  During  this  period  the  layer  moved  downward  steadily  with  a  mean  apparent  velocity  of 
31  cm  s'^  The  most  significant  characteristic  of  the  16  sporadic  layers  appears  to  be  occurrence 
times.  The  layers  formed  either  in  the  late  evening  between  2100  and  2330  LST  or  in  the  early 
morning  between  0300  and  0600  LST.  The  layers  of  the  early  morning  began  forming  within  a 
short  time  span  of  15  min  from  0253  to  0308  LST  on  three  different  days.  The  mean  time  of  the 
maximum  peak  density  of  the  early  morning  layers  occurred  about  6  hours  after  that  of  the  late 
evening  layers.  The  mechanisms  responsible  for  creating  these  layers  appear  to  be  related  to 
diurnal  tides  and  sporadic  E  layers. 

From  July,  1987  to  April,  1988,  the  UIUC  group  conducted  a  total  of  five  Na  lidai 
campaigns  at  Nordlysstasjonen,  Svalbard  (78®N,  16°E),  Norway.  The  characteristics  of  the  Na 
layer  observed  at  Svalbard  are  described  in  Chapter  6.  Tne  seasonal  variations  of  the  abundance 
measured  at  Svalbard  show  an  annual  oscillation  with  a  broad  maximum  in  January  to  April 


169 


followed  by  a  distinct  minimum  in  June.  The  column  abundance  measured  in  June  was  ~6xl0^ 
cm*^,  a  value  that  is  almost  5  times  lower  than  typical  summertinae  Na  abundances  measured  at 
mid-latitudes.  The  centroid  measured  at  Svalbard  shows  the  semi-annual  oscillations  with 
maxima  near  equinox  in  March  and  September  and  minima  near  solstice  in  December  and  June. 
This  semi-annual  oscillations  are  quite  similar  to  those  observed  at  Urbana.  During  the  winter 
months,  the  centroid  height  observed  at  Svalbard  was  generally  lower  than  the  cenax>id  observed 
at  Urbana.  This  could  be  the  result  of  a  stronger  downward  motion  of  the  atmosphere  over  the 
high  latitude  site  of  Svalbard  during  the  winter  months.  The  rms  width  measured  at  Svalbard 
shows  an  annual  oscillation  with  a  maximum  near  February  and  a  minimum  near  September. 

The  characteristics  of  sporadic  Na  layers  observed  at  Svalbard  are  quite  similar  to  those  observed 
at  Hawaii  (Qiapter  5). 

7.2  Recommendations  for  Future  Work 

The  data  analysis  technique  described  in  Chapter  2  can  be  applied  to  airborne,  multiple 
ground-based,  and  steerable  lidar  experiments.  In  June  1981,  steerable  lidar  experiments  were 
conducted  at  the  Goddard  Space  Flight  Center  in  Maryland.  The  new  data  analysis  technique  can 
be  applied  to  these  data  in  order  to  estimate  gravity  wave  parameters.  Additional  steerable 
measurements  with  either  the  Candela  or  the  CEDAR  lidars  are  also  recommended.  Joint 
observations  of  a  steerable  lidar  and  an  MST  radar  will  be  quite  valuable,  because  the  radar  can 
measure  the  background  winds,  and  the  steerable  lidar  can  measure  the  propagation  directions  of 
gravity  waves.  Comparison  of  these  data  can  help  determine  wave  propagation  characteristics  at 
the  altimdes  of  the  Na  layer. 

The  airborne  campaign  which  was  conducted  in  November  1986  was  very  successful. 
The  flight  paths  were  selected  to  investigate  longitudinal  characteristics  of  the  Na  layer  and 
gravity  waves.  Additional  airborne  campaigns  are  recommended  to  investigate  the  latitudinal 


L70 


characteristics  of  the  layer  and  gravity  waves.  Triangular  flight  patterns  are  also  recommended  to 
measure  the  wave  propagation  directions. 

Dominant  waves  with  periods  of  approximately  2  hours  have  been  observed  often  with 
the  Na  lidars  at  Urbana,  Illinois  (40°N)  [Gardner  et  al.,  1986;  Gardner  and  Voelz,  1987; 
Gardner,  1989],  at  Broomfield,  Colorado  (40°N)  [Kwon  et  al.,  1989b],  during  the  eastward 
flight  on  November  15-16, 1986  [Kwon  et  al,  1989a],  and  during  the  westward  flight  on 
November  17-18  [Kwon  et  al.,  1989c].  It  appears  that  these  2-hour  period  waves  often 
dominate  the  bottomside  (80  -  90  km)  of  the  Na  layer  at  mid-latitudes  over  North  America. 
Studies  of  the  2-hour  period  waves  observed  at  Urbana  are  reconunended  to  characterize  the 
waves. 

The  mechanisms  responsible  for  creating  and  dissipating  the  sporadic  Na  layers  are  not 
well  understood  yet  The  sporadic  Na  layers  have  been  observed  often  at  the  low  latitude  site  of 
Hawaii  [Kwon  et  al.,  1988]  and  the  high  latimde  site  of  Svalbard  (Chapter  6),  but  very  rarely  at 
the  mid-latitude  site  of  Urbana  [Senfi  etal.,  1989].  In  November  1987  and  January  1988, 
simultaneous  observations  of  lidar  and  airglow  were  conducted  at  Svalbard.  The  airglow 
observations  were  made  at  the  wavelengths  of  Na,  OH(6-2),  and  Oxygen  atmospheric  (0-1) 
bands.  The  increases  in  the  intensity  of  Na  and  OH  airglow  were  observed  almost 
simultaneously  with  the  appearances  of  sporadic  Na  layers  [private  communication  with  Roger 
Smith,  University  of  Alaska].  Another  recent  observations  of  sporadic  Na  layers  at  Arecibo 
(18®N,  67"W)  in  January,  1989  have  revealed  that  the  altitudes  of  sporadic  Na  layers  coincided 
with  the  altitudes  of  sporadic  E  layers.  These  observations  at  Svalbard  and  Arecibo  are  quite 
unique  and  should  help  determine  the  mechanisms  responsible  for  creating  and  dissipating  the 
sporadic  Na  layers. 

In  January  and  March  1986,  several  daytime  observations  were  obtained  at  Urbana  in 
1986.  More  daytime  observations  at  Urbana  are  strongly  recotrunended  to  characterize  the  mean 
diurnal  variations  of  the  Na  layer  and  to  smdy  the  climatology  of  tidal  waves. 


171 


APPENDIX  I 


RMS  ERRORS  IN  GRAVITY  WAVE  INTRINSIC  PARAMETERS 
FOR  AIRBONRE  LIDAR  OBSERVATIONS  OVER  A 
CIRCULAR  FLIGHT  PATH 

Consider  an  aircraft  flying  over  a  circular  path  of  radius  R  during  a  total  observation 
period  of  Tob.  The  ground  track  of  the  flight  is  illustrated  in  Figure  2.2.  Assume  that  a  total  of  n 
measurements  equally  spaced  along  the  flight  path  is  obtained.  Let  the  reference  time  be  to  = 
Tot/2,  the  reference  horizontal  position  be  (Xo.yo)  =  (0,0),  and  the  reference  altitude  be  Zo.  The 
i*  measurement  referenced  to  (to,  Xo,  yo,  Zo)  is 


Ati  =  ti-to  =  “i- 

^ob 

2 

(1.1) 

Axj  =s  Xt  -  Xo  =  R  cosj 

r2iti^ 

(1.2) 

IvJ 

Ayi=*yi”yo=*Rsin( 

'27ci^ 

(1.3) 

.  n  J 

Azi  =  Zi-Zo 

(1.4) 

where  q,  Xj,  yt,  and  Zj  correspond  to  the  time,  x-coordinate,  y*coordinate,  and  altitude  of  the  i*** 
measurement,  and  1  ^  i  ^  n.  These  measurements  are  substituted  into  Equation  (2.3),  and  the 
resulting  system  of  equations  is  written  in  matrix  form. 

Z  =  U  A  (1.5) 

where  Z  =  [  Azi  Az2  ...  Azn  (1.6) 


172 


Ati  Axi  Ayi  1 

At2  Ax2  Ay2  1 

•  •  *  * 
•  •  *  • 

At„  Ax„  Ay„  1 


A  =  [ai  82  as  34]  (1.8) 

In  order  to  estimate  the  errors  in  the  gravity  wave  parameters,  the  U%  matrix  and  the  coefficient 
covariance  matrix  have  to  be  calculated. 


u‘^u  = 


Sti^ 

StiXi 

Sti 

StiXi 

Sxiyi 

Sxi 

2tiyi 

2xiyi 

Syi 

Sti 

Sxi 

syi 

n 

where  the  summations  are  over  1  ^  i  ^  n.  In  this  case, 


0 


MiRTgjj 

2iz 


(1.10) 


The  coefficient  covariance  matrix  is  given  as  C  =  [U^lJ]'War(z), 


173 


K  std(z) 

*ir— 


Std(z)  [2 

-IT  — Vn 


(1.13) 


APPENDIX  n 

RMS  ERRORS  IN  GRAVITY  WAVE  INTRINSIC  PARAMETERS 
FOR  MULTIPLE  GROUND-BASED  LIDAR  OBSERVATIONS 


Consider  a  configuration  of  three  ground-based  lidars  located  at  the  comers  of  an 
equilateral  triangle  with  sides  of  length  R  as  illustrated  in  Figure  2.3.  The  horizontal  coordinates 
of  the  three  lidar  sites  are 


SiteA:(Xj,yJ=  ^0, 

(n.i) 

«  /'-R  -R  ^ 

SiteB:(Xb,yb)  =  [^. 

(n.2) 

SiteC:  (Xe.yc)  =  ^Y.  'ivfj 

(n.3) 

Assume  that  a  total  of  n  measurements  are  obtained  simultaneously  at  each  of  the  three  lidar  sites 

over  a  total  observation  period  of  Tob.  Let  the  reference  time  be  to  =  Tob/2,  the  reference 

horizontal  position  be  (Xo,yo)  =  (0,0),  and  the  reference  altitude  be  Zq. 

The  differences  between 

the  observations  at  site  A  and  the  reference  data  point  are 

At  —  t 

(n.4) 

B. 

II 

X 

»> 

(IL5) 

Ayai=y. 

(n.6) 

AZjI  —  Zj^  ~  Zq 

(n.7) 

175 


where  Z|i  is  the  observation  of  the  altitude  of  a  density  maximum  or  minimum  made  at  time  t^i 
at  site  A.  Data  from  sites  B  and  C  are  written  using  similar  notations.  These  data  are  substituted 
into  Equation  (2.3),  and  the  resulting  system  of  equations  is  written  in  matrix  form. 

Z  =  U  A  (n.8) 

where  Z  =*  [  Aztj ...  Azu,  Azbi ...  Azbn  Azci ...  Azcn  (11.9) 


At,i  Ax,i  Ay^i 

«  «  • 

Atia  Ax 

in  Ay 

in 

Atbi  Axbi  Aybi 

Atbn  AXbn  Aybn 

Atci  Axci  Ayci 

Aten  AXen  Ay  cn 


(11.10) 


In  order  to  estimate  the  errors  in  the  gravity  wave  parameters,  the  U^U  matrix  and  the  coefficient 
covariance  matrix  have  to  be  calculated. 


(11.11) 


And  the  coefficient  covariance  matrix  is 


By  using  Equations  (2.18)  and  (2.19)  the  rms  errors  in  the  gravity  wave  parameters  are  given  as 


Std(cx^)  = 


Std(z) 

K  R 


(n.i3) 


Std(z)  fl 
^  R  Vn 


(n.i4) 


177 


APPENDIX  in 


CHARACTERISTICS  OF  SPORADIC  SODIUM  LAYERS  OBSERVED 
AT  MAUNA  KEA  OBSERVATORY  ,  HAWAD 


Sporadic 

Layer 

Number 

Date 

Time 

Period 

(LST) 

Duration 

(hourmin) 

Beginning  and 
Ending  Altitudes 
(km) 

Sporadic 
E  layers 

1 

Jan  21 

2211-0230 

4:19 

95.40,  91.50 

Yes 

2 

Jan  21 

2217  -  2223 

0.-06 

97.80,  96.90 

Yes 

3 

Jan  21 

2226-2235 

0:09 

94.50,  93.00 

Yes 

4 

Jan  21 

2200  -  2213 

0:13 

98.25,  96.45 

Yes 

5 

Jan  22 

0253  -  0600a 

3:07b 

91.50,  87.30 

No 

6 

Jan  19 

2114C- 2117a 

0:03b 

96.60,  96.60 

Yes 

7 

Jan  20 

0256  -  062ia 

3:25b 

96.45,  93.15 

No 

8 

Jan  21 

2112-2131 

0:19 

94.35,  94.20 

No 

9 

Jan  21 

2226-2326 

1:00 

98.40,  100.05 

Yes 

10 

Jan  20 

2253  -  2354 

1:01 

96.45,  93.45 

No 

11 

Jan  18 

0308  -  0453 

1:45 

92.55,  93.75 

No 

12 

Jan  21 

2251  -2330a 

0:39b 

103.35,  102.00 

Yes 

13  • 

Jan  20 

2052C-2231 

1:39b 

92.25,  92.55 

No 

14 

Jan  20 

0305  -  0356 

0:51 

94.50,  94.05 

No 

15 

Jan  21 

2226-2248 

0:22 

100.80,  101.25 

Yes 

16 

Jan  21 

2140  -  2200 

0:20 

104.40,  102.15 

No 

^Sporadic  layer  present  when  data  collection  stopped. 

^^Measurement  of  the  duration  was  impossible  because  sporadic  layers  already  formed 
when  data  collection  began  or  was  present  when  data  collection  stopped. 

^Sporadic  layer  already  formed  when  data  collection  began. 


178 


sporadic 

Layer 

Number 


Strength 

Factor 


Maximum 
Densdty 
at  Peak 
(cm*3) 


Hmeof 

Maximum 

Density 

(LST) 


Aldtudeof 

Maximum 

Density 

(km) 


Sporadic 

liiyer 

Abundance 

(cm'2) 


1 

13.8 

27890 

2 

14.1 

17680 

3 

5.1 

12800 

4 

6.2 

9640 

5 

6.8 

9280 

6 

NA 

6600 

7 

4.6 

5870 

8 

1.6 

4760 

9 

4.3 

4180 

10 

2.0 

4160 

11 

1.9 

3400 

12 

16.3 

3390 

13 

2.0 

3030 

14 

1.7 

2860 

15  . 

15.4 

2350 

16 

27.5 

1670 

2224 

95.70 

6.7x109 

2218 

97.50 

2.0x109 

2226 

94.50 

1.5x109 

2211 

97.05 

1.3x109 

0407 

89.85 

1.9x109 

NA 

NA 

7.0x10* 

0451 

93.45 

1.9x109 

2116 

94.20 

7.2x10* 

2226 

98.40 

3.7x10* 

2317 

95.40 

9.6x10* 

0312 

92.85 

6.5x10* 

2321 

101.70 

5.0x10* 

2101 

92.40 

6.9x10* 

0345 

94.20 

4.2x10* 

2229 

100.50 

2.4x10* 

2158 

102.15 

3.2x10* 

NA,  not  available. 


179 


Sporadic 

Layer 

Number 

Average 

Na  Pr^ucdon 
Rate 

(cm*3  s*i) 

Formation 

Period 

(min) 

Average 

Na  Decay 
Rate 

(cm*3  s-0 

Decay 

Period 

(min) 

Mean  Vertical 
Velocity 
of  Layer  Peak 

(cm  s*^) 

1 

28.4 

12.8 

12.64 

27.5 

-28.5 

2 

92.7 

1.8 

22.3 

3.7 

-274 

3 

NA 

NA 

17.3 

9.2 

-293 

4 

8.06 

11.0 

35.4 

1.8 

-205 

5 

1.03 

73.3 

2.18 

60.5 

-35.1 

6 

NA 

NA 

NA 

NA 

NA 

7 

2.71 

18.3 

0.52 

55.0 

-33.2 

8 

2.62 

3.7 

1.17 

14.7 

3.30 

9 

10.9 

1.8 

1.52 

34.8 

155 

-35.9 

10 

2.64 

7.3 

1.95 

16.5 

64.9 

11 

2.03 

3.7 

0.33 

82.5 

12.4 

12 

5.15 

9.2 

NA 

NA 

-184 

-22.8 

13  ‘ 

NA 

NA 

0.28 

60.5 

2.40 

14 

0.23 

40.3 

0.46 

9.2 

-14.7 

15 

1.75 

3.7 

1.86 

12.8 

25.8 

16 

3.23 

3.7 

4.52 

1.8 

-181 

180 


Sporadic  MeanFull  MeanFuU  Mean  Mean  Mean  Mean 

Layer  Width  at  80%  Width  Top-half  Top-half  Bottom-half  Bottom-half 

Number  (km)  Broadening  )^dth  Width  Wdth  Width 

Rate  (km)  Broadening  (km)  Broadening 

(cm  s*^)  Rate  Rate 

(cm  s-i)  (cm  s-^) 


1 

1.25 

8.17 

0.74 

7.41 

0.51 

0.76 

2 

0.68 

27.5 

0.34 

-13.7 

0.34 

41.1 

3 

0.45d 

18.4 

0.27d 

25.8 

0.18d 

-7.37 

4 

0.88 

11.5 

0.35 

6.57 

0.53 

4.93 

5 

1.21 

20.1 

0.64 

11.6 

0.57 

8.46 

6 

0.60 

NA 

0.30 

NA 

0.30 

NA 

7 

1.29 

11.8 

0.66 

7.:o 

0.64 

4.55 

8 

0.56d 

23.6 

0.2ld 

9.89 

0.34d 

13.7 

9 

0.69 

4.67 

0.32 

2.71 

0.36 

1.95 

0.65 

-8.06 

0.33 

-9.44 

0.33 

1.39 

10 

1.54 

11.7 

0.82 

12.1 

0.72 

-0.34 

11 

1.22 

1.67 

0.65 

0.80 

0.57 

0.87 

12 

0.87 

7.35 

0.45 

18.4 

0.42 

-11.1 

0.81 

2.59 

0.43 

0.72 

0.39 

3.31 

13 

1.26 

-0.33 

0.71 

-1.86 

0.55 

1.53 

14 

0.47d 

1.19 

0.27d 

-0.72 

0.20d 

1.92 

15 

0.67 

3.26 

0.31 

6.03 

0.35 

-2.77 

16 

0.90 

-9.45 

0.46 

-20.8 

0.44 

11.4 

d Width  determined  at  95%  of  tlie  peak  density. 


181 


REFERENCES 


Avery,  S.  K.,  and  D.  Tetenbaum,  Simultaneous  sodium  and  wind  measurements  in  the  upper 
mesosphere  using  the  Urbana  meteor  radar  and  lidar  systems,  J.  Atmos.  Terr.  Phys.,  45, 
753-764,  1983. 

Balsley,  B.  B.,  and  D.  A.  Carter,  The  spectrum  of  atmospheric  velocity  fluctuations  at  8  km  and 
86  km,  Geopkys.  Res.  Lett.,  9, 465-468,  1982. 

Balsley,  B.  B.,  and  R.  Garello,  The  kinetic  energy  density  in  the  troposphere,  stratosphere  and 
mesosphere:  A  preliminary  study  using  the  Poker  Flat  radar  in  Alaska,  Radio  Sci.,  20, 1355- 
1362,  1985. 

Batista,  P.  P.,  B.  R.  Clemesha,  D.  M.  Simonich,  and  V.  W.  J.  H.  Kirchhoff,  Tidal  oscillations 
in  the  atmosphere  sodium  layer,  /.  Geopkys.  Res.,  90, 3881-3888,  1985. 

Bernard,  R.,  J.  L.  Fellows,  M.  Massebetif,  and  M.  Glass,  Simultaneous  meteor  radar 
observations  of  Monpazier  (France,  44®N)  and  Punta  Borinquen  (Puerto  Rico,  18®N),  I , 
Latitudinal  variations  of  atmospheric  tides,  J.  Atmos.  Terr.  Phys.,  43, 525-533, 1981. 

Beatty,  T.  J.,  R.  E.  Bills,  K.  H.  Kwon,  and  C.  S.  Gardner,  CEDAR  lidar  observations  of 
sporadic  Na  layers  at  Urbana,  Illinois,  Geopkys.  Res.  Lett.,  IS,  1 137-1 140,  1988. 

Blamont,  J.  E.,  and  T.  M.  Donahue,  The  airglow  of  the  sodium  D  lines,  J.  Geopkys.  Res.,  66, 
1407-1423, 1961. 

Bowman,  M.  R.,  A.  J.  Gibson,  and  M.  C.  W.  Sandford,  Atmospheric  sodium  measured  by  a 
tuned  laser  radar.  Nature,  221, 456-457, 1969. 

Burnett,  C.  R.,  R.  W.  Lasher,  A.  S.  Miskin,  and  V.  L.  Sides,  Spectroscopic  measurement  of 
sodium  airglow:  Absence  of  a  large  diurnal  variation,/.  Geopkys.  Res.,  80, 1837-1844, 
1975. 

Chiu,  Y.  T.,  and  B.  K.  Ching,  The  response  of  atmospheric  and  lower  ionospheric  layer 
structures  to  gravity  waves,  Geopkys.  Res.  Lett.,  5,  539-542,  1978. 

Clemesha,  B.  R.,  V.  W.  J.  H.  Kirchhoff,  D.  M.  Simonich,  and  H.  Takahashi,  Evidence  of  an 
extraterrestrial  source  for  the  mesospheric  sodium  layer,  Geopkys.  Res.  Lett.,  5,  873-876, 
1978. 

Clemesha,  B.  R.,  V.  W.  J.  H.  Kirchhoff,  D.  M.  Simonich,  H.  Takahashi,  and  B.  P.  Batista, 
Spaced  lidar  and  nightglow  observations  of  an  atmospheric  sodium  enhancement,  /. 
Geopkys.  Res.,  85,  3480-3484,  1980. 

Clemesha,  B.  R.,  V.  W.  J.  H.  Kirchhoff,  D.  M.  Simonich,  and  P.  P.  Batista,  Mesospheric 
winds  from  lidar  observations  of  atmospheric  sodium,  /.  Geopkys.  Rss.,  86,  868-870, 
1981. 

Donahue,  T,  M.,  and  R.  R.  Meier.  Distribution  of  sodium  in  the  daytime  upper  atmosphere  as 
measured  by  a  rocket  experiment,  /.  Geopkys.  Res.,  72, 2803-2829, 1967. 


182 


Fricke,  K.  H.,  and  U.  von  Zahn,  Mcsopause  temperature  derived  from  probing  the  hyperfine 
structure  of  the  D2  resonance  line  of  sodium  by  lidar,/.  Atmos.  Terr.  Phys.,  47, 499-512, 
1985. 

Fritts,  D.  C.,  Gravity  wave  saturation  in  the  middle  atmosphere:  A  review  of  theory  and 
observations.  Rev.  Geophys.,  22,  275-308,  1984. 

Fritts,  D.  C,  M.  A.  Geller,  B.  B.  Balsley,  M.  L.  Chanin,  I.  Hirota,  J.  R.  Holton,  S.  Kato,  R. 

S.  Lindzen,  M.  R.  Schoeber,  R.  A.  Vincent,  and  R.  F.  Woodman,  Research  stams  and 
recommendations  from  the  Alaska  workshop  on  gravity  waves  and  turbulence  in  the  middle 
atmosphere.  Bull.  Am.  Meteorol.  Soc.,  65,  149-159, 1984. 

Fritts,  D.  C.,  R.  C.  Blanchard,  and  L.  Coy,  Gravity  wave  structure  between  60  and  90  km 
inferred  from  space  shuttle  re-entry  data,  /.  Atmos.  ScL,  in  press,  1989. 

Gadsen,  M.,  and  C.  M.  Purdy,  Observations  of  the  sodium  dayglow,  Ann.  Geophys.,  26, 43- 
51,  1970. 

Gage,  K.  S.,  and  G.  D.  Nastrom,  On  the  spectrum  of  atmosphere  velocity  fluctuations  seen  by 
MST/ST  radar  and  their  interpretation.  Radio  Sci.,  20, 1339-1347, 1985. 

Gage,  K.  S.,  and  G.  D.  Nastrom,  Theoretical  interpretation  of  atmospheric  wavenumber  spectra 
of  wind  and  temperature  observed  by  commercial  aircraft  during  GASP,  J.  Atmos.  Sci.,  43, 
729-740,  1986. 

Gardner,  C.  S.,  C.  F.  Sechrist,  Jr.,  and  J.  L.  Bufton,  Steerable  lidar  studies  of  the  mesospheric 
sodium  layer  structure,  Proc.  1 1th  Int.  Laser  Radar  Conf.,  Madison,  WI,  1982. 

Gardner,  C.  S.,  and  J.  D.  Shelton,  Density  response  of  neutral  atmospheric  layers  to  gravity 
wave  perturbations,  J.  Geophys.  Res.,  90,  1745-1754,  1985. 

Gardner,  C.  S.,  D.  G.  Voclz,  C.  F.  Sechrist,  Jr.,  and  A.  C.  Segal,  Lidar  studies  of  the  nighttime 
sodium  layer  over  Urbana,  Illinois  1.  Seasonal  and  nocturnal  variations,  J.  Geophys.  Res., 
91, 13659-13673,  1986. 

Gardner,  C.  S.,  and  D.  G.  Voelz,  Lidar  studies  of  the  nighttime  sodium  layer  over  Urbana, 
Illinois.  2.  Gravity  waves  and  tides,  J.  Geophys  Res.,  92,  4673-4694,  1987. 

Gardner,  C.  S.,  D.  C.  Senft,  K.  H.  Kwon,  R.  E.  Bills,  and  K.  Henrikson,  Lidar  observations 
of  sodium  depletion  in  the  summertime  arctic  mesosphere,  AGU  fall  meeting,  San  Francisco, 
Calif.,  Dec.  7-11,  1987. 

Gardner,  C.  S.,  D.  C.  Senft,  and  K.  H.  Kwon,  Lidar  observations  of  substantial  sodium 
depletion  in  the  summertime  arctic  mesosphere.  Nature,  332, 142-144,  1988. 

Gardner,  C.  S.,  Sodium  resonance  fluorescence  lidar  applications  in  atmospheric  science  and 
astronomy,  Proc.  IEEE,  In  press,  1989. 

Gardner,  C.  S.,  M.  S.  >  uller,  and  C.  H.  Liu,  Rayleigh  lidar  observations  of  gravity  wave 
activity  in  upper  ’.tratosphere  at  Urbana,  Illinois,  J.  Atmos.  Sci.,  in  press,  1989. 


183 


Gardner,  C.  S.,  and  D.  C  Senft,  Sodium  lidar  studies  of  gravity  wave  winds  and  spectra  in  the 
mesosphere,  in  preparation,  April,  1989. 

Gibson,  A.  J.,  and  M.  C.  W.  Sandford,  The  seasonal  variations  of  the  nighttime  sodium  layer, 

J.  Atmos.  Terr.  Phys.,33,  1675-1684,  1971. 

Gibson,  A.  J.,  L.  Thomas,  and  S.  K.  Bhattachacharyya,  Laser  observations  of  the  ground-state 
hypofine  stmcture  of  sodium  and  of  temperatures  in  the  upper  atmosphere,  Nature.  281, 
131-132,  1979. 

Hines,  C  O.,  Internal  atmospheric  gravity  waves  at  ionospheric  heights.  Can.  J.  Phys.,  38, 
1441-1481,  1960. 

Holton,  J.  R.,  The  role  of  gravity  wave  induced  drag  and  diffusion  in  the  momentum  budget  of 
the  mesosphere,  J.  Atmos.  ScL,  39, 791-799,  1982. 

Holton,  J.  R.,  The  influence  of  gravity  wave  breaking  on  the  general  circulation  of  the  middle 
atmosphere,  J.  Atmos.  ScL,  40,  2497-2507,  1983. 

Hunten,  D.  M.,  and  L.  Wallace,  Rocket  measurements  of  the  sodium  dayglow,  J.  Geophys. 
Res.,  72,  69-79,  1967. 

Jegou,  J.  P.,  C.  Granier,  M.  L.  Chaitin,  and  G.  Megie,  General  theory  of  the  alkali  metals 
present  in  the  Earth's  upper  atmosphere,  I,  Flux  model:  Chemic^  and  dynamical  processes, 
Ann.  Geophys.,  3,  163-176,  1985a. 

Jegou,  J.  P„  C.  Granier,  M.  L.  Chanin,  and  G.  Megie,  General  theory  of  the  alkali  metals 
present  in  the  Earth's  upper  atmosphere,  n.  Seasonal  and  meridional  variations,  Ann. 
Geophys.,  3,  299-312,  1985b. 

Juramy,  P.,  M.  L.  Chanin,  G.  Megie,  G.  F.  Toulinov,  and  Y.  P.  Doudoladov,  Lidar  sounding 
of  the  mesospheric  sodium  layer  at  high  latitudes,  J.  Atmos.  Terr.  Phys.,  43, 209-215, 

1981. 

Kiichhoff,  V.  W.  J.  H.,  Theory  of  the  atmospheric  sodium  layer:  a  review.  Can.  J,  Phys.,  64, 
1664-1672,  1986. 

Kwon,  K.  H.,  C.  S.  Gardner,  D.  C,  Senft,  F.  L.  Roesler,  and  J.  Harlander,  Daytime  lidar 
measurements  of  tidal  winds  in  the  mesospheric  sodium  layer  at  Urbana,  Illinois,  J. 

Geophys.  Res.,  92,  8781-8786,  1987. 

Kwon,  K.  H.,  D.  C.  Senft,  and  C.  S.  Gardner,  Lidar  observations  of  sporadic  sodium  layers  at 
Mauna  Kea  Observatory,  Hawaii,  J.  Geophys.  Res.,  93, 14199-14208,  1988. 

Kwon,  K.  H.,  D.  C.  Senft,  and  C.  S.  Gardner,  Airborne  sodium  lidar  observations  of 
horizontal  and  vertical  wavenumber  spectra  of  mesopause  density  and  wind  penurbations, 
submitted  to  J.  Geophys.  Res.,  Feb.  1989a. 

Kwoii.  K.  H.,  C.  S.  Gardner,  S.  K.  Avery,  J.  P.  Avery,  and  C.  A.  Whitmord,  Correlative 
radar  and  airborne  sodium  lidar  observations  of  the  vertical  and  horizontal  structure  of  gravity 
waves  and  tides  near  the  mesopause,  in  preparation,  April,  19S9b. 


184 


Kwon,  K.  H.,  and  C.  S.  Gardner,  Airborne  sodium  lidar  measurements  of  gravity  wave 
intrinsic  parameters,  in  preparadon,  April,  1989c. 

lindzen,  R.  S.,  Turbulence  and  stress  owing  to  graviQr  wave  and  tidal  breakdown,  J.  Geophys. 
Res.,  86,  9707-9714,  1981. 

Mathews,  J.  D.,  Measurements  of  the  diurnal  tides  in  the  80-100  km  altitude  range  at  Arecibo,  J. 
Geophys.  Res.,  81,  4671-4677,  1976. 

Manson,  A.H.  and  C.E.  Meek,  Gravity  wave  propagation  characteristics  (60-120  km)  as 
determined  by  the  Saskatoon  MF  radar  (Gravnet)  system:  1983-85  at  52°N,  107°^',  J. 
Atmos.  ScL,  45,  932-946,  1988. 

Meek,  C.  E.,  L  M.  Reid,  and  A.  H.  Manson,  Observations  of  mesospheric  wind  velocities  I. 
Gravity  wave  horizontal  scales  and  phase  velocities  determined  from  spaced  wind 
observations.  Radio  Sci.,  20, 1363-1382,  1985. 

Megie,  G.,  and  J.  E.  Blamont,  Laser  sounding  of  atmospheric  sodium:  Interpretation  in  terms  of 
global  atmospheric  parameters.  Planet.  Space  ScL,  25, 1093-1109, 1977. 

Megie  G.,  M.  L.  Qianin,  G.  F.  Tulinov,  Y.  P.  Dudoladov,  High  latimde  measurements  of  the 
atomic  sodium  concentration  and  neutral  temperature  at  the  mesopause  level  by  lidar 
technique.  Planet.  Space  ScL,  26, 509-511,  1978. 

Neuber,  R.,  P.  von  der  Gathen,  and  U.  von  Zahn,  Altitude  and  temperature  of  the  mesopause  at 
69*N  latimde  in  winter,  J.  Geophys.  Res.,  93,  11093-11101,  1988. 

Nomura,  A.,  T.  Kano,  Y.  Iwasaka,  H.  Fukunishi,  T.  Hirasawa,  and  S.  Kawaguchi,  Lidar 
observations  of  the  mesospheric  sodium  layer  at  Syowa  Station,  Antarctica,  Geophys.  Res. 
Lett.,  14,  700-703,  1987. 

Philbrick,  C.  R.,  J.  Barnett,  R.  Gemdt,  D.  Offermann,  W.  R.  Pendleton,  Jr.,  P.  Schlyter,  J.  F. 
Schmidlin,  and  G.  Witt,  Temperature  measurements  during  the  CAMP  program,  Adv.  Space 
Res.,  4,  153-156,  1984. 

Reid,  I.M.,  and  R.A.  Vincent,  Measurements  of  the  horizontal  scales  and  phase  velocities  of 
shon  period  mesospheric  gravity  waves  at  Adelaide,  Australia,  /.  Atmos.  Terr.  Phys.,  49, 
1033-1048,  1987. 

Richter,  E.  S.,  and  C.  F.  Scchrist,  Jr.,  A  cluster  ion  chemistry  for  the  mesospheric  sodium 
layer,  J.  Atmos.  Terr.  Phys.,41,  579-586,  1979. 

Rowlett,  J.  R.,  C.  S.  Gardner,  E.  S.  Richter,  and  C.  F.  Sechrist,  Jr.,  Lidar  observations  of 
wave-like  smicture  in  the  atmospheric  solium  layer,  Geophys.  Res.  Lett.,  5,  683-686, 

1978. 

Segal,  A.  C.,  D.  G.  Voelz,  C.  S.  Gardner,  and  C.  F.  Sechrist,  Jr.,  Airborne  lidar  observations 
of  the  mesospheric  sodium  layer,  12th  International  Laser  Radar  Conference,  243-246,  G. 
Megie,  Ed.,  Aix  en  Provence,  France,  1984. 

Senft,  D.  C.,  R.  L.  Collins,  and  C.  S.  Gardner,  First  lidar  observations  of  large  sporadic 
sodium  layers  at  mid-latitudes,  submitted  to  Geophys.  Res.  Lett.,  1989. 


185 


Simonich,  D.  M.,  B.  R.  Qemesha,  and  V.  W.  J.  H.  Kiichhofft  The  mesospheric  sodium  layer 
at  23'’S:  Nocturnal  and  seasonal  variadons,  J.  Geophys.  Res.,  84, 1543-1550,  1979. 

Slipher,  V.  M.,  Emissions  in  the  spectrum  of  the  light  of  the  night  slcy,  Pubis.  Astr.  Soc.  Pacif., 
41,  262,  1929. 

Smith,  S.  A.,  and  D.  C.  Fritts,  Esdmadon  of  ^vity  wave  motions,  momentum  fluxes  and 
induced  mean  flow  accelerations  in  the  winter  mesosphere  over  Poker  Flat,  Alaska,  21st 
Cotference  on  Radar  Meteorology,  AMS,  Edmonton,  Canada,  1983. 

Smith,  S.  A.,  D.  C.  Fiitts,  and  T.  E.  Van  Zandt,  Comparison  of  mesosphere  wind  spectra  with  a 
gravity  wave  model.  Radio  ScL,  20,  1331-1338,  1985. 

Swider,  W.,  Enhanced  seasonal  variations  for  chemical  rates  with  inverse  temperature 
dependences:  Application  to  seasonal  abundance  of  mesospheric  sodium,  Geophys.  Res. 
Lett.,  12,  589-591,  1985. 

Thomas,  1.,  A.  J.  Gibson,  and  S.  K.  Bhattacharyya,  Lidar  observations  of  a  horizontal  variation 
in  the  atmospheric  sodium  layer,  J.  Atmos.  Terr.  Pkys.,  39,  1405-1409, 1977. 

Thomas,  L.,  M.  C.  Isherwood,  and  M.  R.  Bowman,  A  theoretical  study  of  the  height 
distribution  of  sodium  in  the  mesosphere,  J.  Atmos.  Terr.  Phys.,  45, 587-594, 1983. 

Thompson,  L.  A.,  and  C.  S.  Gardner,  Experiments  on  laser  guide  stars  at  Mauna  Kea 
Observatory  for  adaptive  imaging  in  astronomy.  Nature,  328, 229-231, 1987. 

Tilgner,  C.,  and  U.  von  2ahn,  Average  properties  of  the  sodium  density  distribution  as  observed 
at  69®N  latimde  in  winter,  J.  Geophys.  Res.,  93,  8439-8454, 1988. 

Tsuda,  T.,  T.  Inove,  D.  C.  Fritts,  T.  E.  Van  Zandt,  S.  Kato,  T.  Sato,  and  S.  Fukao,  MST  radar 
observations  of  a  saturated  gravity  wave  spectrum,  in  preparation,  1988. 

Vincent,  R.  A.,  and  I.  M.  Reid,  HF  doppler  measurements  of  mesospheric  gravity  wave 
momentum  fluxes,  J.  Atmos.  ScL,  40, 1321-1333,  1983. 

Vincent,  R.  A.,  Gravity-wave  motions  in  the  mesosphere,  J.  Atmos.  Terr.  Phys.,  46, 1 19-128, 
1984. 

Vincent,  R.  A.,  and  D.  C.  Fritts,  A  climatology  of  gravity  wave  motions  in  the  mesopause 
region  at  Adelaide,  Australia,  J.  Atmos.  ScL,  44, 748-760,1987. 

von  Zahn,  U.,  and  R.  Neuber,  Thermal  structure  of  the  high  latitude  mesopause  region  in 
winter,  Beitr.  Phys.  Atmos.,  60,  294-304,  1987. 

von  Zahn,  U.,  P.  von  der  Gathen,  and  G.  Hansen,  Forced  release  of  sodium  from  upper 
atmospheric  dust  particles,  Geophys.  Res.  Lett..  14,  76-79,  1987. 

von  Zahn,  U.,  and  C.  Tilgner,  The  sodium  layer  at  69°N  latitude  in  wintertime,  Proc.  8th  Eur. 
Space  As.  Symp.  Eur.  Rocket  and  Balloon  Prog.,  ESA  SP-27G,  Sunne,  Sweden,  17-23, 
May,  1987. 


186 


von  Zahn,  U.,  and  T.  L.  Hansen,  Sudden  neutral  sodium  layers:  A  strong  link  to  sporadic  E 
layers,  J.  Atmos.  Terr.  Phys.,  50, 93-104, 1988. 

von  Zahn,  U.,  G.  Hansen,  and  R  Kutzawa,  Observations  of  the  sodium  layer  at  high  latitudes 
in  summer.  Nature,  331, 594-596, 1988. 

Wang,  S.  T.,  D.  Tetenbaum,  B.  B.  Balsley,  R.  L.  Obert,  S.  K.  Avery,  and  J.  P.  Avery,  A 
meteor  echo  detection  and  collection  ^stem  for  use  on  VHP  rad^.  Radio  Sci.,  23, 46-54, 
1988. 


VITA 

Kang  Hyon  Kwon  was  bom  in  Seoul,  South  Korea  on  September  25,  1960.  He 
attended  the  University  of  Illinois  at  Urbana-Quunpaign,  from  which  he  received  a  Bachelor  of 
Science  degree  in  Electrical  Engineering  in  1984  and  a  Master  of  Science  degree  in  Electrical 
Engineering  in  1986. 

He  was  a  research  assistant  at  the  University  from  1984  to  1987,  and  since  1987  he  has 
been  a  United  States  Air  Force  Laboratory  Graduate  Fellow.  Mr.  Kwon  is  a  member  of  Phi 
Kappa  Phi,  Golden  Key  National  Honor  Society,  and  the  Institute  of  Electrical  and  Electronics 
Engineers.  Mr.  Kwon  has  coauthored  the  following  papers. 

1.  Kwon,  K.  H.,  C.  S.  Gardner,  D.  C.  Senft,  F.  L.  Roesler,  and  J.  Harlander,  Daytime  lidar 
measurements  of  tidal  winds  in  the  mesospheric  sodium  layer  at  Urbana,  Illinois,  J.  Geophys. 
Res.,  92,  8781-8786,  1987. 

2.  Gardner,  C.  S.,  D.  C.  Senft,  and  K.  H.  Kwon,  Lidar  observations  of  substantial  sodium 
depletion  in  the  summertime  arctic  mesosphere.  Nature,  332, 142-144, 1988. 

3.  Beatty,  T.  J.,  R.  E.  Bills,  K.  H.  Kwon,  and  C.  S.  Gardner,  CEDAR  lidar  observations  of 
sporadic  Na  layers  at  Urbana,  Illinois,  Geophys.  Res.  Lett.,  15, 1 137-1 140, 1988. 

4.  Kwon,  K.  R,  D.  C.  Senft,  and  C.  S.  Gardner,  Lidar  observations  of  sporadic  sodium 
layers  at  Mauna  Kea  Observatory,  Hawaii,  J.  Geophys.  Res.,  93, 14199-14208,  1988. 

5.  Kwon,  K.  H.,  D.  C.  Senft,  and  C.  S.  Gardner,  Airborne  sodium  lidar  observations  of 
horizontal  and  vertical  wavenumber  spectra  of  mesopause  density  and  wind  perturbations, 
submitted  to  J.  Geophys.  Res.,  Feb.  1989. 

6.  Kwon.  K.  H.,  C.  S.  Gardner,  S.  K.  Avery,  J.  P.  Avery,  and  C.  A.  Whitmord, 

Correlative  radar  and  airborne  sodium  lidar  observations  of  the  vertical  and  horizontal 
structure  of  gravity  waves  and  tides  near  the  mesopause,  in  preparation,  April,  1989. 

7.  Kwon,  K.  H.,  and  C.  S.  Gardner,  Airborne  sodium  lidar  measurements  of  gravity  wave 
intrinsic  parameters,  in  preparation,  April,  1989. 


188 


LATTICE  STRUCTURES  IN  THE  MAGE  ALGEBRA 
AND  APPLICATIONS  TO  IMAGE  PROCESSING 


By 


JENNIFER  L.  DAVIDSON 


A  DISSERTATION  PRESENTED  TO  THE  GRADUATE  SCHOOL 
OF  THE  UNIVERSITY  OF  FLORIDA  IN  PARTIAL  FULFILLMENT 
OF  THE  REQUIREMENTS  FOR  THE  DEGREE  OF 
DOCTOR  OF  PHILOSOPHY 

UNIVERSITY  OF  FLORIDA 


1989 


August  11,  1989 


Jennifer  L.  Davidson 
201  Walker  Hall 
University  of  Florida 
Department  of  Mathematics 
Gainesville,  FL  32611 


Ms.  Judy  Conover 
Universal  Energy  Systems,  Inc. 
4401  Dayton-Xenia  Road 
Dayton,  OH  45432 


Dear  Ms.  Conover: 

Encloc.<^d  you  will  find  a  copy  of  my  dissertation  as  requested.  I  graduate  on  August 
12,  1989,  and  will  finish  my  time  as  a  graduate  student  at  the  University  of  Florida  at  that 
date. 

I  would  like  to  extend  my  appreciation  and  gratitude  to  UES  for  administering  my 
grant  and  fellowships  in  a  very  efficient  and  pleasing  manner.  All  UES  employees  were 
very  courteous  and  prompt  in  responding  to  my  queries. 


Sincerely, 


Jennifer  L.  Davidson 


THOMAS  E.  WALSH 
DIRECifOa  OF  RtSEAxCH 


ACKNOWLEDGEMENTS 


I  would  like  to  thank  my  advisor,  Dr.  Gerhard  X.  Ritter,  for  teaching  me  so  much 
about  doing  research,  for  allowing  me  the  opportunity  to  perform  independent  research  on 
his  contract,  and  for  giving  me  the  chance  to  work  in  an  exciting  area  of  applied  mathemat¬ 
ics.  Without  his  constant  encouragement,  I  would  not  have  seen  the  beauty  of  mathematics, 
nor  would  I  have  succeeded  in  mathematics  the  way  I  did.  I  thank  Dr.  David  C.  Wilson  for 
providing  the  opportunity  of  working  with  him  during  a  summer,  and  all  the  help  he  has 
given  since  then.  To  Dr.  Joseph  Wilson  I  extend  my  deepest  gratitude  for  helping  me  with 
questions  in  computer  science.  I  am  also  indebted  to  all  the  members  of  my  committee  for 
the  help  and  encouragement  that  they  have  given  me.  To  my  parents  go  a  debt  that  I  can 
only  repay  in  love:  providing  me  the  opportunity  to  attend  a  small,  private  and  very  good 
school,  for  my  undergraduate  education,  which  was  a  critical  turning  point  in  my  life.  I 
would  also  like  to  thank  Dr.  Sam  Lambert  and  Mr.  Neal  Urquhart  of  the  Air  Force  Arma¬ 
ment  Laboratory  and  Dr.  Jasper  Lupo  of  DARPA  for  partial  support  of  this  research  under 
Contract  F0863.5-84-C-0295. 

Finally,  I  acknowledge  my  debt  to  the  American  taxpayers  who  provided  the  financial 
support  for  the  U.S.  Fellowship  programs,  loans,  research  assistantships  and  state  teaching 
assistantships  which  supported  me  through  most  of  my  time  in  graduate  school.  I  hope  to 
contribute  to  society  so  that  this  support  will  be  justified. 


UCSpUBQ  -q  J3JlUU3f 


Xq 

6861  iqSuXdoo 


TABLE  OF  CONTENTS 


Aa<NaVLEDGEMENTS  .  i  i  i 

LIST  OF  SWBOLS .  vi 

ABSTRACT  . v  i  i  i 

INTRODUCTION  .  1 

Background  of  Image  Algebra  .  1 

Parallel  Image  Processing  . 

Summary  of  Results  . 

P.^T  I  .  LATTICE  STRUCTURES  IN  IMAGE  ALGEBRA  AND  OPERATIONS 

RESEARCH  .  7 

CHAPTERS 

1 .  THE  TVND  ALGEBRAS .  9 

1.1.  Image  Algebra;  Basic  Definitions  and  Notation  .  .  9 

1.2.  Mini  max  Algebra  . 2  1 

2.  THE  ISa\DRPHISM  . 3  6 

P.ART  II.  MINIMAX  APPLICATIONS  TO  IMAGE  ALGEBRA  AND  IMAGE 

PROCESSING . -1.5 

3.  \APPING  OF  MINIMAX  ALGEBRA  PROPERTIES  TO  IM\GE  .ALGEBRA 

PROPERTIES . 4  6 

3.1.  Basic  Definitions  and  Properties  .  46 

3.2.  Systems  of  Equations . .57 

3.3.  Rank  of  Templates  .  69 

3.4.  The  Eigenproblem  in  the  Image  .Algebra . 79 

4.  GENERALIZATION  OF  MATIIEN'ATIC.AL  \DRPHOLOGY . 8  4 

5.  TR.‘uNSFORM  DECOMPOSITION . 100 

5.1.  Ncv.'Matri''  Decomposition  Results  .  100 

5.2.  Decomposition  of  Templates  .  121 

5.3.  .Applications  to  Rectangular  Templates  .  127 


6.  THE  DIVISION  AL'XRITHM . 136 

6.1.  A  Division  Aigori thm  in  a  Non>Euc  1  i dean  Domai n  ■  .  136 

6.2.  An  Image  Algebra  Division  Algorithm .  141 

7.  WD  EXAMPLES . 144 

7.1.  An  Operations  Research  Problem  Stated  in  Image 

Algebra  Notation . 144 

7.2.  An  Image  Complexity  Measure . 148 

CONCLUSIONS  AND  SUGGESTIONS  FOR  FURTHER  RESEARCH  . 153 

REFERENCES  . 157 

BIOGRAPHICAL  SKETCH  .  161 


V 


LIST  OF  SYMBOLS 


Symbol 


Explanation 


Z 

R 

R+ 

F 

pa 

oS 

0 

e,^c 

u,n 

/  :X-^Y 

F 

^  ±00 
^+00 

V. A 
X\Y 

W. XY 

w.x.y 

px 

a.  b,c 
0€F^ 


the  set  of  integers 

the  set  of  real  numbers 

the  set  of  non-negative  real  numbers 

an  arbitrary  value  set 

the  identity  element  of  F  under  its  group  operation 

the  Cartesian  product  of  F 

the  power  set  of  S  (set  of  all  subsets  of  S) 

the  empty  set 

is  an  element  of,  is  not  an  element  of,  is  a  subset  of 
set  union,  set  intersection 
/  is  a  function  from  X  to  Y 
the  inverse  of  function  / 
the  set  F  U  {— oo} 
the  set  F  U  {  — oo,  -t-oo} 
the  set  F  U  {+oo} 
maximum,  minimum 
the  set  difference  of  X  and  Y 
coordinate  sets 
pixel  locations 

the  set  of  all  functions  from  X  to  F 
images 

a  constant  image  on  X  with  values  at  each  coordinate  1 
a  constant  image  on  X  with  values  at  each  coordinate  0 


vi 


Symbol 


Explanation 


ie(FXj^ 

Xs(a) 

/(a) 

%) 

5-oo(ty) 

'^+oo(^y) 

r,a,t 

(FX)Y 

© 

Isl 

Ea 

Va 

« 

a 

a 

« 

t 

t 

t  G  Aimn 

ti 

t' 

^±00 

S.M 

Af(a) 

<=> 


I  <j>  if  X  =  y 

a  one-point  template  from  X  to  X  with  ly(x)  =  1  otherwise 

the  null  template  with  4>y(x)  =  — oo  for  all  y  G  Y,''  G  X 

the  characteristic  function  over  set  S  of  image  a 

the  function  /  induced  pointwise  over  image  a 

the  support  of  template  t  G  (R^)^ 

the  infinite  support  of  template  t  G  (F^)'" 

the  positive  infinite  support  of  template  t  G  (F^)'^ 

the  image  function  of  template  t  at  location  y 

templates 

the  set  of  all  F  valued  templates  from  Y  to  X 
generalized  convolution 

multiplicative  maximum,  multiplicative  minimum 
additive  maximum,  additive  minimum 

the  cardinality  function,  counting  the  number  of  elements  in  set  S 

the  sum  of  all  pbcel  values  of  the  image  a 

the  maximum  pixel  value  in  image  a 

the  additive  dual  image  of  the  image  a  G 

the  multiplicative  dual  image  of  the  image  a  G  (R^)^ 

the  additive  dual  template  of  the  template  t  G  (R^)^ 

the  multiplicative  dual  template  of  the  template  t  G  ((Rioo)^)^ 

an  m  X  n  matrix 

the  i-th  column  of  the  matrix  t 

the  i-th  row  of  the  matrix  t 

the  transpose  of  the  matrix  t,  or  the  transpose  of  the  template  t 
a  blog  with  group  F 

the  infinite  support  of  matrix  t  G  at  row  i 
the  infinite  positive  support  of  matrix  t  G  Mn,n  row  i 
the  extended  characteristic  function 
if  and  only  if 


vii 


Abstract  of  Dissertation  Presented  to  the  Graduate  School 
of  the  University  of  Florida  in  Partial  Fulfillment  of  the 
Requirements  for  the  Degree  of  Doctor  of  Philosophy 


LATTICE  STRUCTURES  IN  THE  IMAGE  ALGEBRA 
AND  APPLICATIONS  TO  IMAGE  PROCESSING 

By 

Jennifer  L.  Davidson 
August  1989 


Chairman:  Dr.  Gerhard  X.  Ritter 
Major  Department:  Mathematics 

The  research  for  this  dissertation  is  concerned  with  the  investigation  of  an  algebraic 
structure,  known  as  image  algebra,  which  is  used  for  expressing  algorithms  in  image  process¬ 
ing.  The  major  result  of  this  research  is  the  establishment  of  a  rigorous  and  coherent 
mathematical  foundation  of  the  subalgebra  of  the  image  algebra  involving  non-linear  image 
transformations.  In  particular,  a  classification  in  the  image  algebra  of  a  set  of  non-linear 
image  transformations  called  lattice  transforms  is  presented,  using  minimax  matrix  algebra 
as  a  tool.  Several  applications  to  image  processing  problems  are  discussed.  Specifically,  in 
addition  to  describing  several  non-linear  transform  decomposition  techniques,  the  subalgebra 
is  used  as  a  model  and  a  tool  for  the  development  of  methods  to  compute  latt’ce  transforms 
locally. 


The  basic  operands  and  operations  of  the  image  algebra  and  minimax  algebra  are 
defined,  as  well  as  the  relationships  between  the  two  algebras.  Properties  of  the  minimax 
algebra  including  the  lattice  eigenvalue  problem  are  mapped  to  the  image  algebra. 
Mathematical  morphology  is  shown  to  be  embedded  in  the  image  algebra  as  a  special  sub¬ 
class  of  lattice  transforms.  Networks  of  processors  are  modeled  as  graphs,  and  images  are 
represented  as  functions  defined  on  the  nodes  of  the  graph.  It  is  shown  that  every  lattice 
image-to-image  transform  can  be  weakly  factored  into  a  product  of  lattice  transformations 
each  of  which  are  implementable  on  the  network  if  and  only  if  the  graph  corresponding  to 
the  network  is  strongly  connected.  Necessary  and  sufficient  conditions  are  given  to  decom¬ 
pose  a  rectangular  template  into  two  strip  templates.  A  division  algorithm  is  given  which  is 
a  generalization  of  a  boolean  skeletonizing  technique.  The  transportation  problem  from 
linear  programming  is  expressed  in  the  image  algebra.  A  method  to  produce  an  image  com¬ 
plexity  measure  is  discussed.  Most  results  are  given  both  in  image  algebra  and  matrix  alge¬ 
bra  notation. 


ix 


INTRODUCTION 


Ba/^kpnnnd  r>f  thff  Tmagp  Algehra. 

The  results  presented  in  this  dissertation  reflect  the  ongoing  investigation  of  the  struc¬ 
ture  of  the  Air  Force  image  algebra,  an  algebraic  structure  specifically  designed  for  use  in 
image  processing.  The  idea  of  establishing  a  unifying  theory  for  concepts  and  operations 
encountered  in  image  and  signal  processing  has  been  pursued  for  a  number  of  years  now.  It 
was  the  1950’s  work  of  von  Neumann  that  inspired  Unger  to  propose  a  "cellular  array" 
machine  on  which  to  implement,  in  parallel,  many  algorithms  for  image  processing  and 
analysis  [l,2|.  Among  the  machines  embodying  the  original  automaton  envisioned  by  von 
Neumann  are  NASA’s  massively  parallel  processor  or  MPP  [3],  and  the  CLIP  series  of  com¬ 
puters  developed  by  M.J.B.  Duff  and  his  colleagues  [4,5|.  A  more  general  class  of  cellular 
array  computers  are  pyramids  (6]  and  the  Connection  Machine,  by  Thinking  Machines  Cor¬ 
poration  (7). 

Many  of  the  operations  that  cellular  array  machines  perform  can  be  expressed  by  a  set 
of  primitives,  or  simple  elementary  operations.  One  opinion  of  researchers  who  design  paral¬ 
lel  image  processing  architectures  is  that  a  wide  class  of  image  transformations  can  be 
represented  by  a  small  set  of  basic  operations  that  induce  these  architectures.  G.  Matheron 
and  J.  Serra  developed  a  set  of  two  primitives  that  formed  the  basis  for  the  initial  develop¬ 
ment  of  a  theoretical  formalism  capable  of  expressing  a  large  number  of  algorithms  for  image 
processing  and  analysis.  Special  purpose  parallel  architectures  were  then  designed  to  imple¬ 
ment  these  ideas.  Several  systems  in  use  today  are  Matheron  and  Serra's  Texture 


2 


Analyzer  (8),  the  Cytocomputer  at  the  Environmental  Research  Institute  of  Michigan  (ERIM) 
(9,10),  and  Martin  Marietta’s  GAPP  (11). 

The  basic  mathematical  formalbm  associated  with  the  above  cellular  architectures  are 
the  concepts  of  pixel  neighborhood  arithmetic  and  mathematical  morphology.  Mathematical 
morphology  is  a  mathematical  structure  used  in  image  processing  to  express  image  processing 
transformations  by  the  use  of  structuring  elements,  which  are  related  to  the  shape  of  the 
objects  to  be  analyzed.  The  origins  of  mathematical  morphology  lie  in  work  done  by  H.  Min¬ 
kowski  and  H.  Hadwiger  on  geometric  measure  theory  and  integral  geometry  (12,13,14).  It 
was  Matheron  and  Serra  who  used  a  few  of  Minkowski’s  operations  as  a  basis  for  describing 
morphological  image  transformations  (15,16),  and  then  implemented  their  ideas  by  building 
the  Texture  Analyzer  System.  Some  recent  research  papers  on  morphological  image  process¬ 
ing  are  Crimmins  and  Brown  (17),  Haralick,  Lee  and  Shapiro  (18),  Haralick,  Sternberg  and 
Zhuang  (19),  and  Maragos  and  Schafer  (20,21,23). 

It  was  Serra  and  Sternberg  who  first  unified  morphological  concepts  into  an  algebraic 
theory  specifically  focusing  on  image  processing  and  image  analysis.  The  first  to  use  the  term 
'Image  .A.lgebra"  was,  in  fact,  Sternberg  (23,24).  Recently,  a  new  theory  encompassing  a 
large  class  of  linear  and  nonlinear  systems  was  put  forth  by  P.  Maragos  (25).  However, 
despite  these  profound  accomplishments,  morphological  methods  have  some  well  known  limi¬ 
tations.  They  cannot,  with  the  e.xception  of  a  few  simple  cases,  express  some  fairly  common 
image  processing  techniques  such  as  Fourier-like  transformations,  feature  extraction  based  on 
convolution,  histogram  equalization  transforms,  chain-coding,  and  image  rotation.  .\t 
Perkin-Elmer,  P.  Miller  demonstrated  that  a  straightforward  and  uncomplicated  target 
detection  algorithm,  furnished  by  the  U.S.  Government,  could  not  be  expressed  using  a  mor¬ 
phologically  based  image  algebra  126). 


3 


The  morphological  image  algebra  is  built  on  the  Minkowski  addition  and  subtraction  of 
sets  [14],  and  it  is  this  set-theoretic  formulation  of  its  basic  operations  which  does  not  enable 
mathematical  morphology  to  be  used  as  a  basis  for  a  general  purpose  algebraic  based 
language  for  digital  image  processing.  These  operations  ignore  the  linear  domain,  transfor¬ 
mations  between  different  domains  (spaces  of  different  dimensionalities)  and  transformations 
between  different  value  sets,  e.g.  sets  consisting  of  real,  complex,  or  vector  valued  numbers. 
The  image  algebra  which  was  developed  at  the  University  of  Florida  includes  these  concepts 
and  also  incorporates  and  extends  the  morphological  operations. 


Parallel  Tmagp  Prr)pp9<;ing' 

The  processing  of  images  on  digital  computers  requires  enormous  amounts  of  time  and 
memory.  With  the  advent  of  Very  Large  Scale  Integrated  (VLSI)  circuits,  the  cellular  array 
of  von  Neumann  became  a  reality.  There  are  many  types  of  parallel  architectures  in 
existence  (27|,  and  various  ways  of  categorizing  them  have  been  attempted  (28j.  The  general 
scheme  of  one  popular  type  of  parallel  processor  is  to  have  many  processing  elements,  or 
small  processors  with  limited  memory,  interconnected  by  communication  links.  Each  pro¬ 
cessing  element  can  communicate  directly  with  a  single  controller  as  well  as  with  a  very 
small  number  of  its  neighbors,  usually  I  to  8.  When  the  controller  gives  a  signal,  all  process¬ 
ing  elements  simui^  xneously  perform  some  arithmetic  and/or  logic  operation  using  the  values 
of  its  neighbors  to  which  it  is  connected.  This  type  of  parallel  processor  is  called  a  neighbor¬ 
hood  array  processor,  as  communication  links  connect  the  center  processor  to  a  small  subset 
of  its  spatially  nearest  neighbors.  Two  typical  neighborhood  configurations  for  local  inter¬ 
connection  links  are  given  below.  The  box  with  the  x  represents  the  center  processor,  and 
the  four  (or  eight)  boxes  immediately  adjacent  to  x  represent  the  four  (or  eight)  processors 


4 


with  whom  x  can  send  and  receive  information  via  the  communication  links.  The  set  of  pixel 
locations  relative  to  the  center  pixel  location,  x,  form  the  local  neighborhood  of  x. 


7 — ^ 
X 


(a) 

Figure  1.  Two  Neighborhood  Configurations. 

(a)  The  von  Neumann  Configuration;  (b)  The  Moore  Configuration. 


■ 

■ 

(b) 


Some  of  the  piuallel  processors  that  have  been  built  to  implement  this  type  of  connection 
scheme  are  the  MPP,  the  Distributed  Array  Processor  (ICL  DAP)  [27,29],  the  Geometric 
Arithmetic  Parallel  Processor  (GAPP),  and  the  CLIP4.  There  are  other  types  of  parallel 
architectures,  such  as  pipeline  computers  [30]  and  systolic  arrays  [31],  which  differ  in  con¬ 
struction  and  implementation  of  the  neighborhood  functions.  However,  the  key  feature  in 
most  of  these  architectures  is  that  they  have  a  large  number  of  processing  elements,  each  of 
which  communicates  directly  with  only  a  small  subset  of  the  others. 

If  every  value  of  a  transformed  image  at  location,  x  involves  arithmetically  or  logically 
manipulating  information  only  from  pixel  locations  in  the  local  neighborhood  of  x,  then  the 
transform  is  called  a  local  transform.  .Assuming  that  a  transform  can  be  described  in  a  local 
manner,  the  amount  of  time  to  perform  a  local  operation  globally  on  neighborhood  array  pro¬ 
cessors  is  the  amount  of  time  it  takes  one  processor  to  perform  it,  often  a  single  clock  cycle. 
Certain  image  transforms  which  were  previously  too  computationally  intensive  can  now  be 
implemented  on  parallel  and  distributed  processors. 


5 


In  general,  image  transforms  are  not  loc2il,  that  is,  the  calculation  of  a  transformed 
value  may  depend  on  input  values  which  are  spatially  very  distant  from  the  processing  ele¬ 
ment.  In  order  to  use  parallel  processors,  the  transform  must  first  be  decomposed  into  a  pro¬ 
duct  of  local  transforms.  The  existence  of  local  decompositions  is  of  theoretical  and  practical 
interest,  and  as  such  provides  the  main  thrust  behind  the  research  in  this  dissertation. 

While  such  parallel  architectures  are  attractive  for  use  in  image  processing,  much 
research  still  needs  to  be  done  and  implementation  techniques  developed  in  order  to  use  the 
architectures  most  efficiently. 


Summary  of  Rf>stilts 

The  results  in  this  dissertation  stem  from  an  investigation  into  the  image  algebra  opera¬ 
tions  of  0 ,  ® ,  and  V.  A  brief  description  of  the  image  algebra  and  its  use  as  a  model  for 
image  processing  is  presented.  A  full  discussion  of  the  entire  image  algebra  is  presented  by 
Ritter  et  al.  (32j.  The  results  given  here  focus  mainly  on  two  non-linear  image  transform 
operations  whose  underlying  values  have  the  structure  of  a  lattice.  In  particular,  it  is  shown 
that  a  previously  determined,  well-defined  mathematical  structure  called  the  minimax  alge¬ 
bra  can  be  used  to  place  the  study  of  a  wide  class  of  non-linear,  lattice-based  image 
transforms  on  a  solid  mathematical  foundation.  We  also  discuss  the  mapping  of  these 
transforms  to  certain  types  of  parallel  architectures. 

It  has  been  well  established  that  the  image  algebra  is  capable  of  expressing  all  linear 
transformations  [33l.  The  embedding  of  linear  algebra  into  the  image  algebra  makes  this 
possible,  The  major  contributions  of  this  thesis  are  the  development  of  two  isomorphisms 
between  the  minimax  algebra  and  image  algebra  which  refines  the  lattice  subalgebra  of  the 


6 


image  algebra,  and  the  development  of  new  and  useful  mathematical  tools  which  are  of  prac¬ 
tical  use  in  the  area  of  image  processing. 

The  dissertation  is  divided  into  two  parts.  Part  I  gives  an  introduction  to  the  two  alge¬ 
bras,  the  image  algebra  and  minimax  algebra.  Part  II  is  devoted  to  presenting  new  matrix 
theoretical  results  which  have  applications  to  solving  image  processing  problems. 

Specifically,  Chapter  1  is  of  an  introductory  nature,  presenting  a  historical  background  of  the 
image  algebra  and  a  brief  discussion  of  where  lattice  structures  appear  to  be  useful  in 
mathematically  characterizing  problems  in  image  processing  and  operations  research. 
Chapter  1  also  presents  a  brief  introduction  to  the  image  algebra  as  well  as  to  the  minimax 
algebra.  We  mention  that  although  vector  lattices  are  contained  within  the  image  algebra, 
they  have  been  investigated  [34]  and  will  not  be  discussed  here.  The  isomorphisms  which 
embed  the  minimax  algebra  into  the  image  algebra  are  given  in  Chapter  2,  and  mapping  of 
the  minimax  algebra  properties  in  image  algebra  notation  are  presented  in  Chapter  3.  In 
Chapter  4  we  give  the  relationship  of  mathematical  morphology  to  image  algebra.  In 
Chapter  5  we  present  new  matrix  theoretical  results,  which  have  applications  to  template 
decomposition.  An  algorithm  similar  to  the  division  algorithm  for  integers  is  given  both  in 
minimax  algebra  and  image  algebra  notation  in  CLapter  6.  In  Chapter  7  we  present  the  for¬ 
mulation  of  an  operations  research  in  image  algebra  notation,  and  give  an  image  complexity 
algorithm.  We  then  present  the  conclusions  and  give  suggestions  for  future  research  after 
Chapter  7. 


7 


E 


PARTI 

LATTICE  STRUCTURES  IN  IMAGE  ALGEBRA  AND  OPERATIONS  RESEARCH 

The  algebraic  structures  of  early  image  processing  languages  such  as  mathematical 
morphology  had  no  obvious  connection  with  a  lattice  structure.  Those  algebras  were 
developed  to  express  binary  image  manipulation.  As  the  extension  to  gray  valued  images 
developed,  the  notions  of  performing  maximums  and  minimums  over  a  set  of  numbers 
emerged.  Formal  links  to  lattice  structures  were  not  developed  until  very  recently  [34|, 
including  this  dissertation.  We  present  a  little  background  in  this  area,  showing  how  the  lat* 
tice  properties  were  inherent  in  the  structures  being  developed. 

The  algebraic  operations  developed  by  Serra  and  Sternberg  are  equivalent  and  based  on 
the  operations  of  Minkowski  addition  and  Minkowski  subtraction  of  sets  in  R".  Given  A 
C  R"  and  B  C  R",  Minkowski  addition  is  defined  by 

A+B  =  {a  +  b:a€A,beB} 
and  Minkowski  subtraction  is  defined  by 

A / B  =  A+B  , 

where  the  bar  denotes  set  complementation.  Mathematical  morphology  was  initially 
developed  for  boolean  image  processing,  that  is.  for  processing  images  that  have  only  two 
values,  say  0  and  1.  It  was  eventually  extended  to  include  gray-level  image  processing,  that 
is.  images  that  take  on  more  than  two  values.  The  value  set  underlying  the  gray  value 
mathematical  morphology  structure  was  the  set  R.^^  =  R  U  {  — -c  }.  the  real  numbers  with 
— oo  adjoined.  Sternberg's  functional  notation  is  most  often  used  to  express  the  two  morpho¬ 
logical  operations,  as  it  is  simply  stated  and  easy  to  implement  in  computer  code.  The  gray 
value  operations  of  dilation  and  erosion,  corresponding  to  Minkowski  addition  and 


i 


8 


subtraction,  respectively,  are 

D(x,y)  =  m^(A{x  -  i,y  -  i)  +  B(i,j)l 
E(x,y)  =  min  (A(x  -  i,y  -  i)  -  B(-i,-j)l 
respectively,  where  A  and  B  are  real  valued  functions  on  R*. 

As  will  be  shown,  mathematical  morphology,  which  uses  the  lattice  R.qq,  is  actually  a 
very  special  subalgebra  of  the  full  image  algebra.  It  is  well  known  that 
R^„  s  R  U  {+00,  —00  }  is  a  complete  lattice  [36].  The  lattice  structure  provides  the  basis 
for  categorizing  certain  classes  of  image  processing  problems,  which  is  the  main  subject  of 
this  dissertation. 

Operations  research  has  long  been  known  for  its  class  of  problems  in  optimization.  A 
certain  type  of  non-linear  operations  research  problems  has  been  the  focus  of  Cuninghame- 
Green  during  his  research  [37,38].  The  types  of  optimization  problems  that  were  considered 
by  this  author  used  arithmetic  operations  different  from  the  usual  multiplication  and  summa¬ 
tion.  Some  machine  scheduling  and  shortest  path  problems,  for  example,  could  be  best 
characterized  by  a  non-linear  system  utilizing  additions  and  maximums.  A  monograph  enti¬ 
tled  Minimax  Algebra  [39]  describes  a  matrix  calculus  which  uses  a  special  case  of  what  is 
called  a  generalized  matrix  product  [40],  where  matrices  and  vectors  assume  values  from  a 
lattice.  A  few  more  conditions  such  as  a  group  operation  on  the  lattice,  and  the  self-duality 
of  the  resulting  structure,  allow  Cuningharae-Green  to  develop  a  solid  mathematical  founda¬ 
tion  in  which  to  pose  a  wide  variety  of  operations  research  questions.  It  is  an  interesting  and 
natural  link  between  matrices  with  values  in  a  lattice  and  templates  in  the  image  algebra 
which  provides  the  foundation  of  this  dissertation. 


9 


CHAPTER  1 
THE  TWO  ALGEBRAS 

1.1  Imny  Al(^<>hrar  Rnsir  Definitions  and  NWtinn 

This  section  provides  the  basic  definitions  and  notation  that  will  be  used  for  the  image 
algebra  throughout  the  dissertation.  We  will  define  only  those  image  algebra  concepts  neces¬ 
sary  to  describe  ideas  in  this  document.  For  a  full  discourse  on  all  image  algebra  operands 
and  operations,  we  refer  the  reader  to  a  recent  publication  [3lj. 

The  image  algebra  is  a  heterogeneous  algebra,  in  the  sense  of  BirkholT  [-{Ol,  and  is  capa¬ 
ble  of  describing  image  manipulations  involving  not  only  single  valued  images,  but  mul¬ 
tivalued  images.  In  fact,  it  has  been  formally  proven  that  the  set  of  operations  is  sufficient 
for  expressing  any  image-to-image  transformation  defined  in  terms  of  a  finite  algorithmic  pro¬ 
cedure,  and  also  that  the  set  of  operations  is  sufficient  for  expressing  any  image-to-image 
transformation  for  an  image  which  has  a  finite  number  of  gray  values  [•41,421.  We  limit  our 
discussion  to  single  valued  images  in  this  document,  and  refer  the  reader  to  other  publica¬ 
tions  on  multi-valued  images  (31 1. 

We  will  present  the  six  basic  operands,  some  of  the  finitary  operations  defined  between 
the  operands,  and  also  give  a  few  examples. 

UJ _ The  Operands  of  the  Image  .Algebra 

The  six  basic  operands  are  coordinate  sets,  elements  of  coordinate  sets,  value  sets,  ele¬ 
ments  of  value  sets,  images,  and  generalized  templates.  They  are  defined  as  follows. 


10 


1.  A  coordinate  set  X  is  a  subset  of  for  some  k.  Two  familiar  coordinate  sets,  the 
rectangular  and  toroidal  coordinate  sets,  are  shown  in  Figure  2. 


Xo 


o  x 


(1.4)  (2,4)  (3.4) 

3 1  (U)  (2.3)  (3.3) 

(1,2)  (2.2)  (3,2) 

li  (U)  (2.1)  (3.1) 


(a) 


(b) 


Figure  2.  Two  Coordinate  Sets. 

(a)  Toroidal  Lattice  X  C  R®;  (b)  A  Finite  Rectangular  Array  in  R®. 


2.  A  value  set  F  is  a  semi-group.  Some  value  sets  we  are  interested  in  are  the  real 
numbers,  the  rational  numbers,  integers,  positive  reals,  positive  rationals,  and  positive 
integers.  These  are  denoted  by  R,  Q,  Z,  R"^,  Q"*",  and  Z*,  respectively.  We  will  also 
be  strongly  interested  in  some  of  the  extended  number  systems.  If 
FG{R,Q,Z,R+,Q+},  then  F_o(,  denotes  F  U  {— oo},  F+oo  denotes  F  U  { +'3c},  and 
F±oo  denotes  F  U  {— oo, -Hoo}.  We  denote  an  arbitrary  value  set  by  F. 

3.  An  F  valued  image  a,  on  a  coordinate  set  X  is  an  element  of  F^.  Thus,  an  image 
a  c  F^  is  of  form 


a  =  {(x,a(x));x  €X,  a(x)  G  F  }. 


11 


4.  Let  X  and  Y  be  coordinate  sets.  An  F-valued  template  t  from  Y  to  X  is  an  element 
of  For  each  y  G  Y,  t(y)  is  an  image  on  X  Denoting  t(y)  by  ty,  we  have 

t,  =  { (x,t,(x)) ;  X  6  X,  t,(x)  6  P  }  tor  all  y  g  Y. 

We  give  a  pictorial  representation  of  a  generalized  template  t  G  (F^)^  in  Figure  3. 
They  are  discussed  in  detailed  in  the  section  below  on  generalized  templates. 


The  set  X  is  called  the  set  of  image  coordinates  of  a  G  F^,  and  the  range  of  the  func¬ 
tion  a  is  called  the  image  values  of  a.  Thus,  the  image  values  are  a  subset  of  F.  The  pair 
(x.a(x))  is  called  a  picture  element,  or  pixel,  and  x  is  the  pixel  location  of  the  pixel  value  or 
gray  value  a(x).  We  shall  use  bold  lower  case  letters,  x,  to  represent  a  vector  in  R".  and 
lower  case  letters  (not  bold)  for  the  components  of  the  vector.  Thus  x  =  f  Xi ,  .  .  .  ,  Xn )  G  R". 
where  X;  G  R  for  all  i.  The  set  of  all  F  valued  images  on  X  is  denoted  by  F^.  and  the  set  of 
all  F  valued  templates  from  Y  to  X  is  denoted  by  (F^)^. 

As  we  will  not  be  using  any  of  the  operations  concerning  coordinate  sets  or  value  sets, 
we  refer  the  reader  to  other  publications  discussing  this  topic  [321. 


12 


1 .1  «>■  Opprations  on  Images 

The  basic  operations  on  and  between  F  valued  images  are  the  ones  induced  by  the  alge¬ 
braic  structure  of  the  value  set  F.  The  remaining  operations  can  be  defined  in  terms  of  these 
basic  ones.  In  particular,  if  F  —  R,  then  the  basic  operations  for  a,  b  €  are 

a  -h  b  =  { (x,c(x)) ;  c(x)  =  a(x)  +  b(x),  x  €  X} 
a  *  b  =  { (x,c{x)) :  c(x)  =  a(x)  •  b(x),  x  G  X} 
a  V  b  =  {  {x,c{x)) :  c(x)  =  a(x)  V  b(x),  x  G  X} . 

If  X  is  finite,  then  we  define  the  dot  product  of  two  images  a,  b  G  R^  by 

a  •  b  Ij  a(x)  -  b(x) . 

We  say  an  image  a  G  F^  is  a  constant  image  if  its  gray  value  at  every  pixel  location  is 
the  same.  Thus,  a  constant  image  a  G  F^  has  form 

a(x)  =  kGF,  forallxGX. 

In  this  case  we  write  k  for  the  image  a.  There  are  two  constant  images  of  importance  in  the 
image  algebra.  One  is  the  zero  image,  defined  by  0  =  {(x,0))  :  x  G  X},  and  the  unit  image, 
defined  by  1  =  {(x,0)) :  x  G  X}.  These  images  have  the  following  properties. 

a-l-0=0+a  =  a 
a*l  =  l*a  =  a 

Suppose  f;  F  — ►  F  is  given.  Then  f  induces  a  function  from  F^  to  F^.  also  called  f, 
where 

f(a)  =  {(x,(b(x)) ;  b(x)  =f(a^x))}. 

For  example,  the  function  f:R\{0}  — *•  R\{0}  where  f(r)  =  r“^  induces  a  function  f: 

F^  — ►  F^,  where  i^a)  =  b,  and  b(x)  =  l/a(x),  if  a(x)  #  0,  otherwise  b(x)  =  0.  The  image  b 
so  described  is  denoted  by  a"^  It  is  obvious  that  a  *  a“^  1  for  every  a.  But  it  is  true  that 

a  *  a"^  *  a  =  a.  For  this  reason  a“^  is  called  the  pseudo  inverse  of  a. 


13 


If  the  value  set  F  =*  R±oo)  additive  dual  of  a  G  is  denoted  by  a*  and 


defined  by 


— a(x)  if  a(x)  G  R 

a*(x)  =  ■  —00  if  a(x)  =  +oo 

+00  if  a(x)  =  —00 


Thus  we  have  ( a*  )*  =  a. 


If  F  =  R^<a,  then  the  multiplicative  dual  of  a  G  (R^)^  is  denoted  by  a  and  defined  by 


l/a(x)  if  a(x)  G  R+ 

—00  if  a(x)  =s  +0O 

+00  if  a(x)  «  — oo 


It  follows  that  (a)  —  a. 


Another  useful  induced  function  is  the  characteristic  function.  Let  Xt  denote  the  usual 
characteristic  function  with  respect  to  an  arbitrary  set  T.  Here 

^  ^  j  1  ifxGT 

XtW  -  I  0  ifx^T  • 

We  now  define  the  generalized  characteristic  function  of  an  image  a  G  R^.  Let  a  G  R^  and 
S  G  (2*^)^.  Then  the  generalized  characteristic  function  of  an  image  a  is  defined  as 


>(^(a)  =  c  G  R^ 


where 


n  (  \\  I  ^  P  if  a(x)  G  S(x) 
c  =  { (X,  c(x)) ;  c(x)  =  I  j  }. 

Note  that  the  usual  characteristic  function  above  is  a  special  case  of  the  generalized  charac¬ 
teristic  function,  where  T  C  F  and  S(x)  =  T  for  all  x  G  X  The  typical  thresholding  func¬ 
tion  applied  to  an  image  is  a  simple  example  of  the  generalized  characteristic  function.  Fix 
b  G  R^.  Then  S<bG(2*^)^  is  defined  by 


14 


S^(x)={r€R:r<b(x)}. 

To  simplify  notation,  we  define  =  Xs<^-  Thus,  for  b ,  a  €  F^,  we  have 


X^(a)  =  { (x,  c{x)) :  c(x)  =  {  J  ^  • 


If  we  now  consider  the  characteristic  function  on  R^,  we  find  that  we  would  like  our 
binary  output  image  to  have  the  values  — oo  and  0  instead  of  O’s  and  I’s,  respectively.  We 
define  the  extended  characteristic  function  as  the  function  induced  by 


xfM 


0 


—00 


if  xGS 
otherwise 


Thus,  X^(a)  is  defined  as 


0  if  a(x)  <  b(x) 
—00  otherwise 


One  unary  operation  on  images  is  the  sum  operation,  which  we  will  use  in  Chapter  7.  Let  X 
be  a  finite  coordinate  set.  Then  the  sum  of  a  €  R^  is  defined  to  be 


I]a  =  a*l  =  S  a(x). 

*  x6X 

In  context  of  the  lattice  structures  of  R^^  and  R^  we  make  the  following  definition. 
Let  a  G  R^.  The  maximum  of  a  is  the  scalar  determined  by 

Va  =  V  a(x). 

.x6X^ 


1  1  r.pneraliTipH  Templiitps 

For  a  generalized  template  t  G  (F^)'’,  the  coordinate  set  Y  is  called  the  target 
domain  or  the  domain  of  t,  and  X  is  called  the  range  space  of  t.  The  pixel  location  y  G  Y 
at  which  a  template  ty  is  evaluated  is  called  a  'arget  point  ot  the  template  t.  and  the  values 
ty(x)  are  called  the  weights  of  the  template  t  at  y. 


15 


If  F  €  {R,  C  },  then  for  t  G  {F^)\  the  set 

S{ty)  =  {xGX;ty(x)?60} 

is  called  the  support  of  ty.  If  F  G  {R^^.  then  for  t  G  (F^)"'^  we  define 

5-oo(hy)  =  {x  GX:  ty(x) /-oo} 
and 

5+oo(ty)  =  {  X  G  X  :  ty(x)  #  +00  } 
to  be  the  (negative)  and  positive  infinite  support,  respectively. 

If  t  G  (F^)^  and  for  all  triples x,  y ,  *  G  X  with  y  +  x  and  x  +  x  G  X,  we  have 
^y(^)  —  tiy^,(x  +  x),  then  t  is  called  translation  invariant.  A  template  which  is  not  transla¬ 
tion  invariant  is  called  translation  variant,  or  simply  variant.  Translation  invariant  tem¬ 
plates  have  the  nice  property  that  they  may  be  represented  pictorially  in  a  concise  manner. 
The  following  translation  invariant  template  is  presented  pictorially  in  Figure  4.  Let 
X  =  Y  =  Z*,  and  let  y  =  (i,j)  G  T?.  Let  Xj  =  (i,j),  Xj  =  (i+lj),  Xg  =  (i,j-l),  and  x^  = 

(i+l,j— 1).  Define  the  weights  by  ty(x)  = 

%)  =  {xi,  .  .  .  ,x<  }. 


if  X  s=Xj,  i  =  1,...,4 
otherwise 


Then 


y 


7 - ^ 

1 

S  / 

2 

3 

4 

i  i+1 

Figure  4.  Exainple  of  a  Translation  Invariant  Template. 


16 


The  cell  with  the  hash  marks  in  the  pictorial  representation  of  t  indicates  the  location  of  the 
target  point  y. 

There  are  several  representations  of  a  template  that  we  will  be  concerned  with.  One  is 
the  transpose  of  a  template.  Let  t  €  Then  the  transpose  of  t  is  a  template 

t'  e  defined  by  t'Jiy)  =  ty(x).  If  F  €  {R^^,  R+o).  then  we  can  introduce  a  dual 

template.  For  t  G  (R^)^,  the  additive  dual  of  t  is  the  template  t*  G  (R^Li)^  defined  by 


— ty(x)  if  ty{x)  G  R 

—00  if  ty(x)  =  +00. 

if  ty{x)  =  —  00 


Similarly,  if  t  G  ((R^)*)\  the  multiplicative  dual  of  t  is  the  template 
t  G  ((R±„)'^)^  defined  by 


l/ty{x)  if  ty(x)  G  R"*" 

—00  if  ty(x)  =  +00. 

if  ty(x)  =  —00 


1  1  4.  Operations  Retween  Images  and  Templates 

One  common  use  of  templates  is  to  describe  some  transformation  of  an  input  image 
based  on  its  image  values  within  a  subset  of  the  coordinate  set  X.  We  first  introduce  the 
generalised  product  between  an  image  and  a  template.  Let  X  C  R"  be  finite, 

X  =  {xj,  .  .  .  ,Xn,}.  Let  7  be  an  associative  binary  operation  on  the  value  set  F.  Then  the 
global  reduce  operation  T  on  F^  induced  by  7  is  defined  by 

r(a)  =  r  a(x)  =a(xi)7a(x2)7  ■  •  ’  7a(xJ. 

x€\ 


where  aGF^.  Thus,  FiF^  — ►  F. 


17 


Images  and  templates  are  combined  by  combining  appropriate  binary  operations.  Let 
Fj,  Fo,  and  F  be  three  value  sets,  and  suppose  orFjX  Fo  — ►  F  and  b  :  FoX  Fj  — ►  F  are 
binary  operations.  If  'y  is  an  associative  binary  operation  on  F ,  a€F^,  and  1 6(F^)^,  then 
the  generalized  backward  template  operation  of  a  with  t  {induced  by  7  and  0)  is  the  binary 
operation  @  :F^X  (F^)^  — ►  F^  defined  by 

a @  t  =  {(y,b(y)) :  b(y)  =  J^a(x) oty(x),  y  G  Y} . 

If  tG(F^)^,  then  the  generalized  forward  template  operation  ot  a  with  t  is  defined  as 

t  a  a  =  {(y,b(y)) :  b(y)  =  T  ^(y) 6 a(x),  y  6  Y} . 

x€X 

Note  that  the  input  image  a  is  an  Fj  valued  image  on  the  coordinate  set  X,  and  the 
output  image  b  is  an  F  valued  image  on  the  coordinate  set  Y,  regardless  of  which  template 
operation,  forward  or  backward,  is  used.  Templates  can  therefore  be  used  to  transform  an 
image  on  one  coordinate  set  and  with  values  in  one  set  to  an  image  on  a  completely  different 
coordinate  set  whose  values  may  be  entirely  different  from  the  original  image’s. 

Only  three  special  cases  of  the  above  generalized  operation  have  been  investigated  in 
detail,  one  by  Gader  (32l  and  the  other  two  in  this  dissertation.  Future  research  will  cer¬ 
tainly  discover  other  useful  combinations.  These  three  operations  are  denoted  by  ©,  13 ,  and 
©.  The  operation  ©  is  a  linear  one,  and  we  refer  the  interested  reader  to  other  references 
for  recent  research  in  this  area  (32,43,44).  The  other  two  operations,  M  and  © ,  are  non¬ 
linear,  and  investigation  of  the  structure  they  induce  on  images  and  templates  is  the  focus  of 
this  dissertation. 

Since  our  main  interest  concerns  the  operations  Q  and  © ,  we  will  omit  the  definition 
©  and  refer  the  interested  reader  to  another  reference  (31).  Let  X  C  R"  be  finite  and  Y 
C  R"^*.  Let  a  €  R?oo  ^  ^  (®’-oo)^-  Then  the  backward  additive  max  is  defined  as 


18 


acat  =  {(y,b(y)):  b(y)  =  V  a(x)+ty{x),  y  6  Y}, 
where  V  a(x)+ty(x)  =  max{  a(x)+ty(x) :  x  6  X  }. 

For  t  €  (RJoo)^  we  define  the  forward  additive  max  transform  by 

tn  a  =  {(y,b(y)) :  b(y)  =  V  a(x)  +t^(y),  y  €  Y } . 

x€X 

We  use  the  usual  extended  arithmetic  addition  rd — oo  =  — oo+r  =  — oo  V  r€R_oo  to 
define  a(x)+ty(x)  everywhere. 

For  a  €  R^i,  and  t  6  ((Ri’oo)^)^  we  define  the  backward  multiplicative  max 
transform 

a©t  =  {(y,b(y)):  b(y)  =  ^y^a(x)-ty(x),  y  6  Y}. 

The  forward  multiplicative  max  transform  is  given  by 

t©a  s  {(y,b(y)):  b(y)  =  V  a(x)-t^(y),  y  6  Y}, 

x€X 

where  t€((R:!;„)^)^. 

Recall  that  a  lattice-ordered  group,  or  1-group,  is  a  group  which  is  also  a  lattice.  The 
operation  addition  (multiplication)  on  the  1-group  R  (R'^')  can  be  extended  in  a  well-defined 
manner  to  addition  (multiplication)  on  R^^  (Rioo)  by  defining 

X  X  — oo  =  — oo  X  X  =  — oo,  X  6  G  U  {  — oo) 

X  X  +CO  =  -f 00  X  X  =  -foe,  X  G  G  U  { -f oo} 

— OOX  •4-O0  =  -f0OX  — 0O  =  — 00 

where  X  G  {-h,  *  },  depending  on  whether  G  =  R  or  R"^,  respectively.  Of  course,  the  ele¬ 
ments  -f-oo,  — oo  have  no  additive  inverse  under  the  operation  +  or  *,  and  hence  (or 
R±oJ  bt  no  longer  a  group.  This  is  discussed  in  detail  in  section  1.2,  where  the  notion  of  a 
bounded  lattice  ordered  group,  an  extension  of  a  lattice-ordered  group  with  extended 


19 


arithmetic,  is  introduced.  This  provides  for  the  value  set  to  be  used  in  the  definition  of 
the  image-template  operation  Q ,  for  example,  and  the  value  set  to  be  used  in  the 
definition  of  the  image-template  operation  Q>. 


We  remark  that  for  computational  as  well  as  theoretical  purposes,  we  can  restate  the 
above  two  convolutions  with  the  new  pbcel  value  calculated  only  over  the  support  of  the  tem¬ 
plate  t.  If  5_<,o(ty) then  V__a(x)-t-ty(x)  =  V  a(x) -f  ty(x),  and  we  have 

x€X  x€S_ocj[ty) 


a  0  t  =  {(y,b(y) :  h{y)  =  ^  V  a(x)-l-t  (x),  y  G  Y} 

X6S_«(ty) 

Similarly,  if  5_oo(ty)  7^  <^,  then  ^a{x)*ty(x)  =  ^  ^a(x)  *  ty(x),  and 


a  ©  t  =  {(y,b{y) ;  b(y)  =  ^  V  ,  ^a(x)*ty(x),  y  6  Y}. 


X6 


If  in  either  case  S_^{ty)  =  (J),  then  we  define 


V  a(x)+ty(x)  or  V  a(x)*ty(x)  =  —  oo  . 
x€s_^t,r  ’ 

We  may  therefore  restrict  our  computation  of  the  new  pixel  value  to  the  infinite  support  of 

tyf  This  becomes  particuhrly  important  when  considering  mapping  of  transforms  to  certain 

types  of  parallel  architectures,  as  will  be  discussed  in  the  introductory  remarks  to  Part  II. 

and  Chapter  5. 


Because  of  the  duality  inherent  in  the  two  structures  R^^^  and  R^y  the  operations  0 
and  ©  induce  dual  image-template  operations,  called  addtiive  minimum  and  multiplicative 
minimum,  respectively.  They  are  defined  by 

a  E3  t  =  (t*  0  a*  )* 
and 


a  (S)  t  =  (t  ®  a) . 


20 


Equivalently,  we  have 

a  0  t  s  {(y,b(y));  b(y)  =  ^^a(x) +'ty(x),  y  €  Y} 
=  {(y.b(y) ;  b(y)  =  A  a(x)4^ty(x),  y  €  Y} 
and 


t  ®  a  =  {(y,b(y)):  b(y)  =  A  a(x) *'t^(y),  y  6  Y} 

x€X 

=  A,^Xx)*\(y),y6Y}. 

where  the  dual  operations  and  *'  are  presented  in  section  1.2.  As  before,  if  5+oo(ty)  =  (t>, 
we  define 


-  A  a(x)4^t,(y)  or  A  a(x)*\(y)  =  +oo . 

The  above  definitions  assume  that  the  support  5_oo(by)  is  finite  for  each  y  6  Y.  We 
may  extend  the  above  definitions  to  continuous  functions  a  and  ty  on  a  compact  set 
This  is  well-defined  as  the  sum  or  product  of  two  continuous  functions  on  a  compact  subset 
of  R",  which  is  continuous,  always  contains  a  maximum.  Extending  the  basic  properties  of 
the  image  algebra  operations  involving  0  and  ®  from  the  discrete  case  to  the  continuous 
case  should  present  little  difficulty,  and  remains  an  open  problem  at  this  time. 


1  1  Operatinns  Rptween  Generali7.ed  Templates 

The  pointwise  operations  of  the  value  set  F  can  also  be  extended  to  to  operations 
between  templates.  For  example,  if  F  =  R,  then  we  have 

s  + 1  =  r ,  where  Ty  =  Sy  -h  ty 
s  *  t  =  r,  where  Ty  =  Sy  *  ty 
s  V  t  =  r ,  where  Ty  =  Sy  V  ty  . 


21 


If  F  =  R^oo»  define 


s  + 1  =  r ,  where  ry(x)  =  1 


Sy(x)  +  ty(x)  if  X  e  S_«,(ty)  n  5_oo(8y) 

Sy(x)  ifxG  5_oo(Sy)\5_<„(ty) 

ty{x)  ifxG  S_<„(ty)\S_oo(ty)  • 

—00  otherwise 


Note  that  in  the  case  where  s  and  t  have  no  values  of  — oo  or  +oo  anywhere,  then  the 
definition  of  s  + 1  on  the  value  set  degenerates  to  the  definition  of  s  + 1  on  the  value 
set  R. 

The  generalized  image-template  operation  ®  generalizes  to  a  generalized  template- 
template  product.  Let  X  C  R“  be  finite,  X  =*  {xj,  .  .  .  ,x,„},  and  let  7  be  an  associative 
binary  operation  on  the  value  set  F  with  global  reduce  operation  F  on  F^.  Let 
Pj,  Fj,  and  F  be  three  value  sets,  and  suppose  0  ;  Pj  X  Fj  F  is  a  commutative  binary 
operation.  If  7  is  an  associative  binary  operation  on  F,  t  6  (F^)^,  and  s  €  (F^  )^,  then 
the  generalized  template  operation  oi  t  with  a  {induced  by  7  and  0)  is  the  binary  operation 
@  :  (Fi^)^  X  (Fo"')'^'  defined  by 

t  @  s  =  r  G  (F^)^,  where 
■•yW  =  j;/.(x).M»).yeY,x6X. 

Note  that  if  1x1  =1,  then  the  definition  of  the  generalized  template  operation  of  t  and  s 
degenerates  to  the  definition  of  the  generalized  backward  template  operation  of  the  image 
t  G  F^  with  the  template  s  G  and  r  G  F^.  If  I  y1  =  1,  then  the  definition  of  the 

generalized  template  operation  of  t  and  s  degenerates  to  the  definition  of  the  forward  tem¬ 
plate  operation  of  the  image  s  G  Fo'^  with  the  template  t  G  (Fi^)"^,  where  r  G  F^. 

The  specific  cases  for  @  C3 ,  or  ®  thus  generalize  to  operations  between  tem¬ 
plates.  We  give  the  definitions  for  Q  and  ® ,  and  refer  the  reader  to  another  reference  for 


22 


the  definition  of  ©  [32].  Let  t  €  and  s  €  (R2J[,)^.  Then  s  Q  t  =  r  €  (R^)'^  is 

defined  by 

=  V  tv(x)  +  8-(w) ,  where  w  €  W  . 

Again,  as  in  the  image>template  operations,  we  may  restrict  our  computation  to  a  subset  of 
JL  In  particular,  for  y  €  Y,  we  define  the  set 


S-oe(w)  =  {x  6  X  :  X  e  5_oo(ty)  and  w  €  5_oo(sx)  }• 

Then  r  =8  M  t  G  (R^)^  is  defined  by 

V  t  (x) +8..,(w), 

x€S^w) 

where  we  define  V  ty(x)  +  Sx(w)  =  — oo  whenever  S_<^(w)  —  0. 

x6S_od(w)  ' 

The  operation  ®  has  a  similar  situation.  We  have  r  =  s  ®  t  €  ( which  is 
defined  by 


where  we  define  V  ty(x)  ■  sJw)  =  — oo  whenever  S(w)  =  0. 

»  x6S(w)  ' 

It  follows  from  these  definitions  that  the  infinite  support  of  the  template  r  is 
S-»(ry)  =  {w€W:  s.jw)^0}. 

The  definitions  given  in  this  section  are  the  elemental  ones.  Further  definitions  that 
play  important  parts  in  the  theoretical  development  of  the  lattice  structure  of  the  image 
algebra  will  be  presented  as  needed. 

We  define  the  complementary  operations  of  IZ3  and  ®  for  templates  in  the  natural 
way.  Let  t  G  (R^)'^  and  s  G  Then  s  Q  t  G  (R^)'^  is  defined  by 

8  E3  t  =  (t*  M  s*  )* 


23 


Similarly,  for  t  €  ((R-^oo)^)^  and  s  €  s  ®  t  G  ((R±oo)''^)'^  's  defined  by 

s  ®  t  =  (t  ®  s ) . 

We  would  like  to  remark  upon  one  notationai  deviation  between  the  Overview's  (32j 
definition  for  the  ®  operations  and  the  one  presented  here.  Let  R^  = 

{r  G  R  :  r  >0}  U  {+oo}.  In  the  Overview,  for  a  G  (R^)^  and  t  G  ((R|oo)^)^i  ^he 

backward  multiplicative  max  transform  is  defined  as 

a  ©  t  =  {(y,b(y)) ;  b(y)  =  V  a(x)  •  ty(x),  y  G  Y} 

x€X 

which  is  equivalent  to 

a  ®  t  =  {(y,b(y)) :  b(y)  =  V  a(x)  •  ty(x),  y  G  Y} 

x€S(t,) 

with  b(y)  =  0  if  S{ty)  =  0.  The  difference  between  the  definition  given  earlier  and  this  one 
is  the  value  set,  namely  R^^  in  this  document  and  R^  in  the  Overview.  The  number  0  acts 
as  a  lower  bound  in  R|2o  exactly  as  — oo  acts  as  a  lower  bound  in  R^^^.  Multiplication  of  the 
element  0  with  the  element  oo  follows  the  same  rules  as  multiplication  of  the  element  — co 
with  the  element  oo  as  given  on  page  18.  In  other  words,  the  element  0  can  replace  symboli¬ 
cally  the  element  — oo.  The  main  advantage  of  using  the  number  0  instead  of  the  symbol 
—00  is  for  ease  of  machine  and  software  implementation.  Most  real  image  processing  data 
will  have  no  values  corresponding  to  the  symbol  +oo,  and  quite  often  have  non-negative 
values,  including  O’s.  Using  0  as  the  bottom  element  enables  that  value  to  be  represented 
easily  in  the  computer,  while  special  programming  methods  would  have  to  be  considered  to 
represent  the  symbol  — oo.  For  purposes  which  will  become  clear  in  the  course  of  this  docu¬ 
ment,  we  have  remained  with  the  notation  R^^^.  In  implementing  any  of  the  ideas  in  this 
dissertation,  if  the  value  set  at  hand  is  R^^o-  i*-  should  be  clear  that  the  symbol  — oo  can  be 
replaced  with  a  0  and  5_oo(t(y))  replaced  by  5(t(y)),  so  that  representation  in  computers  may 
be  more  easily  accomplished. 


24 


1.2  Minimav  Alyphra. 

The  last  40  years  have  seen  a  number  of  different  authors  discover,  apparently  indepen¬ 
dently,  a  non-linear  algebraic  structure,  which  each  has  used  to  solve  a  different  type  of  prob¬ 
lem.  The  operands  of  this  algebra  are  the  real  numbers,  with  — oo  (or  -l-oo  adjoined),  with 
the  two  binary  operations  of  addition  and  maximum  (or  minimum).  The  extension  of  this 
structure  to  matrices  was  formalized  mathematically,  in  the  environment  in  which  the  above 
problems  were  posed,  by  Cuninghame-Green  in  his  book  Minimax  Algebra  [39].  It  is  well 
known  that  the  structure  of  R  with  the  operations  of  and  V  is  a  semi-lattice  ordered 
group,  and  that  (R,\/,  A,+)  is  a  lattice-ordered  group,  or  an  l-group  [36].  Viewing  R_oo  as 
a  set  with  the  two  binary  operations  of  +  and  V,  and  then  investigating  the  structure  of  the 
set  of  all  n  X  n  matrices  with  values  in  R_oo.  leads  to  an  entirely  different  perspective  of  a 
class  of  non-linear  operators.  These  ideas  were  applied  by  Shimbel  [46]  to  communications 
networks.  Two  authors,  Cuninghame-Green  [37,38]  and  Giffler  [47]  applied  them  to  the 
problem  of  machine-scheduling.  Others  [48,49,50,51]  have  discussed  their  usefulness  in  appli¬ 
cations  to  shortest  path  problems  in  graphs.  Cuninghame-Green  gives  several  examples 
throughout  his  book  [39],  primarily  in  the  field  of  operations  research.  Another  useful  appli¬ 
cation,  to  image  algebra,  was  again  independently  developed  by  G.X  Ritter  et  al.  [52]. 

In  fact,  the  notion  of  a  matrix  product  can  be  generalized  to  what  is  called  the  general- 
ized  matrix  product  [40],  whose  definition  is  given  below. 

Let  F  denote  a  set  of  numbers.  Let /and  g  be  functions  from  F  X  F  into  F.  For 
simplicity,  assume  the  oinary  operation  /  to  be  associative.  Let  F'”'*  denote  the  set  of 
all  m  X  p  matrices  with  values  in  F,  and  let  (ajj)  =  A  6  F'"**  and  ( bji^)  =  B  €  FP". 
Define  f-  gtohe  the  function  from  F^p  X  FP"  into  F"*”  given  by 


25 


(/•5)(A,B)  =  C, 

where  '  ‘  •  /(a^ppbpj,  for  i  =l,...,m,  k  =  l,...,n,  and  / 

and  g  are  viewed  as  binary  operations. 

Thus,  if  / denotes  addition  and  g  multiplication,  then  is  the  ordinary  matrix  pro¬ 

duct  of  matrices  A  and  B.  Cuninghame-Green  develops  the  setting  for  a  formal  matrix  cal¬ 
culus  based  on  the  two  binary  operations  -f  and  V  of  the  extended  real  numbers,  analogous  to 
linear  algebra  which  uses  the  two  operations  of  multiplication  and  arithmetic  sum.  He  terms 
this  matrix  theory  minimax  matrix  theory.  The  development  of  the  theory  is  performed  in 
the  abstract,  with  an  eye  towards  applications  for  matrices  with  values  in  the  set  The 
importance  of  Cuninghame-Green’s  work  to  the  image  algebra  is  that  not  only  is  the 
minimax  matrix  algebra  embedded  in  the  image  algebra  for  the  set  but  also  for  the  set 
The  set  (R'*‘,V,  A,*)  is  an  1-group  also.  An  image  algebra  transform  using  either  M 
or  ©  can  thus  be  viewed  as  a  matrix  transform  in  the  minimax  algebra  for  the  respective 
case  of  R^ge  or  R^^,.  This  completes  the  mathematical  identification  of  the  three  main 
subalgebras  in  the  image  algebra.  The  linear  transforms  were  classified  by  Gader  j33j  who 
showed  that  linear  algebra  is  embedded  into  image  algebra.  As  a  result  of  each  embedding 
above,  the  full  power  of  the  respective  mathematical  theory  can  be  applied  to  solving  prob¬ 
lems  in  image  processing,  as  long  as  the  image  processing  problem  can  be  formulated  using 
image  algebra  operations  of  ©,  O ,  or  © .  Since  it  has  been  formally  proven  that  the  image 
algebra  can  represent  all  image-to-image  transforms  (see  section  1.1),  the  embeddings  are 
very  useful  to  have. 

The  rest  of  this  section  is  devoted  to  introducing  the  basic  notions  of  the  minimax  alge¬ 


bra  structure  and  properties. 


26 


1  9  1  Raain  Dpfinit.inns  anrf  Notation 

Let  F  be  a  semi-lattice  ordered  semi-group  with  semi-lattice  operation  V  and  semi¬ 
group  operation  X.  Thus,  F  satisfies 

x\/(y\/2)=(xVy)Vz  Aj 

xVy=yVx  A, 

X  V  X  =  X  Ag 

as  it  is  a  semi-lattice,  as  well  as 

xX{yXz)=:(xXy)Xz  A^ 

as  it  has  an  associative  group  operation  X ,  and 

xx(yVz)=(xXy)V(xXz)  Ag 

(yVz)xx=(yXx)V(zXx)  Ag 

as  it  is  an  ordered  semi-group.  We  call  this  structure  a  belt,  in  the  vein  of  rings.  The  operac 
tion  V  is  called  an  addition,  and  the  operation  X  a  multiplication.  We  shall  also  call  a  semi¬ 
lattice  an  S'lattice. 

Suppose  the  belt  F  also  satisfies  the  dual  to  axioms  Ai  through  Ag,  where  x '  is  another 
binary  group  multiplication: 


xA(yAz)={xAy)A2 

A'. 

X  A y  =y  A X 

A', 

X  Ax  =  x 

xX'(yx'z)=(xX'y  )x'z 

X  x'(  y  Az  )  =  (  X  X'y  )  A(  X  x'z  ) 

.V. 

0 

(yAz)x'x=(yx'x)A(zx'x). 

A'e 

Here,  X '  is  called  a  dual  multiplication,  and  A  is  called  a  dual  addition.  The  (group)  multipli¬ 
cation  or  dual  multiplication  is  not  assumed  to  be  commutative. 


27 


If  in  addition  to  the  above  12  axioms,  F  satisfies  the  following  axiom, 
xV(yAx)  =  xA(yVx)  =  x, 

then  F  is  a  belt  with  duality.  If  the  multiplication  X  and  dual  multiplication  x' coincide, 
then  we  call  the  multiplication  self-dual.  A  belt  with  duality  and  self>dual  multiplication 
corresponds  to  a  lattice-ordered  semi-group,  or  1-semi-group,  in  lattice  theory. 

Let  (Fj,V)  and  (F2,V)  be  two s-lattices.  A  function  f:  Fj  —►Fj  is  an  s-lattice 
homomorphism  if 

f(xVy)  =  f(x)Vf{y), 

for  all  x,y  €  F.  If  Fj  and  Fo  are  belts  and  f:  Fj  — ♦  Fj  is  an  s-lattice  homomorphism,  then  if 
f  also  satisfies 

f(x  X  y)  =  f(x)  X  f(y) 

for  all  x,y  €  F,  then  we  say  that  f  is  a  belt  homomorphism.  The  following  is  an  example  of  a 
belt  isomorphism.  Define  f:  R  — ►  R"*"  by 

f(x)  =  e^ 

Then  f(x  V  y)  =  f(x)  V  f(y),  and  f(x  +  y)  =  f(x)  *  f(y).  It  is  trivial  to  show  that  f  is  a  belt 
isomorphism. 

The  belts  R  and  R"*"  are  commutative  belts,  that  is,  the  multiplication  X  commutes. 

Each  also  has  an  identity  element  under  the  multiplication,  namely  0  for  R  and  1  for  R"*". 

Because  they  are  groups,  each  element  r  €  F  has  a  unique  multiplicative  inverse;  we  call 

such  a  belt  a  division  belt,  by  analogy  with  division  rings.  A  belt  has  a  null  element  if  there 
\ 

exists  an  element  ^  €  F  such  that 

Vx€F,xV^  =  x  and  xx  9  =  9x  x  =  9. 

The  belts  (R_oo,V,+)  and  (R^<»,V,+)  each  have  the  element  — ooas  its  null  element. 


28 

A  division  belt  with  distinct  operations  X  and  V  and  with  duality  corresponds  to  a 
latticc'ordered  group,  or  1-group.  In  fact,  if  (F,V,X  )  is  a  belt  with  distinct  operations  V 
and  X,  then  by  defining 

X  A  y  =  (  V  y“^ )“‘ ,  V  x,y  €  F  (1-2) 

we  have  introduced  a  second  (dual)  s-lattice  operation  A  such  that  (F,V,A)  becomes  a  (distri¬ 
butive)  lattice  [36].  In  our  terms,  the  division  belt  F  acquires  a  duality  with  a  self-dual  mul¬ 
tiplication.  Our  main  interest  will  be  for  the  1-groups  (F,V,X  ,A,x')  =(R,\/,+,A,+) 
and  (R"^,  V,  * ,  A, -^ ),  *  representing  real  multiplication.  From  the  above  discussion,  it  fol¬ 
lows  that  ( R ,  V ,  + ,  A ,+)  and  ( R"*" ,  V ,  * ,  A ,  * )  are  isomorphic  as  1-groups. 

An  arbitrary  1-group  F  having  two  dbtinct  binary  operations  V  and  X  can  be  extended 
in  the  following  way.  We  adjoin  the  elements  +oo  and  — oo  to  the  set  F  and  denote  this 
new  set  by  F^^oi  where  — oo  <  x  <  -fooV  x  €  F.  We  define  a  multiplication  and  a  dual 
multiplication  in  by:  if  x,y  6  F,  then  x  X  y  is  already  defined.  Otherwise, 

X  X  —00  s=  —00  X  X  =  —00,  X  €  F  U  { — oo} 

X  X  -foos=+oox  X  =  +00,  X  S  F  U  {+oo} 

X  x'  -oo  =  — oox'x  =  — 00,  X  G  F  U  {— oo} 

X  x'  +oo  =  +oox'x  =  +00,  X  G  F  U  {+oo}. 

— oox  +oo  =  +oox  — oo  =  — oo 

— 00  x'  +00  =  +00  x'  — oo  =  +00 


The  element  — oo  acts  as  a  null  element  in  the  entire  system  (Fjj.o5,V,X  )  and  the  element 
+00  acts  as  a  null  element  in  the  entire  system  (F^.^,,,  A,x').  However,  the  multiplications 
X  and  x'  are  asymmetric  between  the  elements  — oo  and  +oo.  The  elements  in  F  are  called 
the  finite  elements. 


29 


We  call  such  a  system  {F^oo,V,X  ,A,x')  a  bounded  I'group,  and  F  is  called  the  group  of 
the  bounded  1-group 

The  two  bounded  l-groups  (R±o8.V,+,  and  V,*,  A,*')  will  be  our  main 
concern.  Another  bounded  l-group  of  interest  is  the  3-element  bounded  1-group  with  group 
denoted  by  F3.  Note  that  the  boolean  algebra  ({— oo,^},V,A)  is  embedded  in  F3,  with  OR 
=  V  (maximum),  AND  =  A  (minimum),  FALSE  =  —00,  and  TRUE  =0.  It  is  simple  to 
check  that  the  familiar  truth  tables  hold. 

Let  (F  ,\/,X  )  be  a  belt,  and  let  (T,\/)  be  an  s-lattice.  Suppose  we  have  a  right  multi¬ 
plication  of  elements  of  T  by  elements  of  F; 

X  X  X  €  T  V  pairs  x,X,  x  6  T,  X  €  F. 

We  call  (T ,  V)  a  right  s-lattice  space  over  (F,V,X ),  or  just  say  T  is  a  space  over  F  if  the  fol¬ 
lowing  axioms  are  satisfied  for  all  x,y  €  T  and  for  all  X,  €  Ft 

(T,  V)  is  an  s— lattice 
(xxX)x;x=xX(Xx/0 
(x  Vy)x  X=(xx  X)  V(y  X  X) 
xx  (XV/f)=(xx  X)  V(xx  fi) 
and  if  F  has  an  identity  element 

XX  ^  =  X. 

Such  spaces  play  the  role  of  vector  spaces  in  the  minimax  theory.  If  T  and  F  are 
known,  then  we  shall  simply  say  that  T  is  a  space. 

A  subspace  is  a  subset  of  a  space  which  is  itself  a  space  over  the  belt  F. 

Let  (S ,  V),  (T,\/)  be  given  spaces  over  a  belt  (F,V,X ).  An  s-lattice  homomorphism 


30 


g:  (S,\/)  -♦  (T ,  V)  is  called  right  linear  {over  F)  if 

g(xX  X)=g(x)x  X  Vx€S,VX6F. 

We  denote  the  set  of  all  right-linear  homomorphisms  from  S  to  T  over  F  by 
Komp(S,T).  That  is, 

Homp(  S,T)={g:S— ♦■Tisa  homomorphism  and  g(x  X  X)  =  g(x)  X  XVx€S,VX€F}. 

Let  (F,V,X)  be  a  belt  and  (T,  V)  be  an  s-lattice,  and  suppose  we  have  defined  a  left 
multiplication  of  elements  of  T  by  elements  of  F: 

X  X  X  €  T  V  pairs  x,X,  x  G  T,  X  G  F. 

The  left  variants  of  the  above  five  axioms  are  easily  stated.  We  define  a  system  satisfying 
those  left  axioms  a  left  space  over  F.  This  allows  us  to  define  a  two-sided  space.  A  two- 
sided  space  is  a  triple  ( L ,  T ,  R )  such  that 

L  is  a  belt  and  T  is  a  left  space  over  L . 

R  is  a  belt  and  T  is  a  right  space  over  R . 

VXGL,  VxGT  and  V/iGR:Xx  (xx  n)  =  {\x  x)x  fi. 

Let  (F ,  V,X  )  be  a  belt.  An  important  class  of  spaces  over  F  is  the  class  of  function 
spaces.  Here,  the  s-lattice  (T ,  V)  is  (F  ^,V).  Such  spaces  are  naturally  two-sided.  We  shall 
only  be  interested  in  the  case  where  I  Ul  =  n  G  Z'*'.  A  space  (T ,  V)  is  of  form  (F  ",V),  and 
hence  our  spaces  F"  are  spaces  of  n-tuples. 

When  discussing  conjugacy  in  linear  operator  theory,  two  approaches  are  commonly 
used.  One  defines  the  conjugate  of  a  given  space  S  as  a  special  set  S*  of  linear,  scalar-valued 
functions  defined  on  S.  The  other  involves  defining  an  involution  taking  x  G  S  to  x*  G  S* 
which  satisfy  certain  axioms.  (Recall  a  function  f  is  an  involution  if  =  ^•)  The  situa¬ 

tion  is  slightly  more  complicated  in  the  case  of  lattice  transforms. 


31 


Let  (S,V,X )  and  (T,A,X')  be  given  belts.  We  say  that  (T,A,x')  is  conjugate  to 

(S ,  V,  X )  if  there  is  a  function  g:  S  — ►  T  such  that 

g  is  bijective 

Vx,y€S,g(xVy)  =  g(x)  Ag(y)  C, 

V  x,y  €  S ,  g(x  X  y)  =  g(y)  x'  g(x).  C3 

In  lattice  theory,  g  U  called  a  dual  iaomorphiam.  Note  that  conjugacy  is  a  symmetric 
relation.  If  (S ,  V,  A)  is  an  s-lattice  with  duality  satisfying  the  first  two  axioms,  then  we  say 
that  S  is  aelf-conjugate.  If  (S,V,X  ,A,X')  a  belt  with  duality,  we  say  that  (S,V,X  ,A,x')  is 
self-conjugate  if  (S,A,x')  is  conjugate  to  {S,V,X). 

In  particular,  every  division  belt  is  self-conjugate  under  the  bijection  x*  =  x~S  and 
every  bounded  1-group  is  self-conjugate  under  the  bijection  (—00)*  =  -foo,  (+00)*  =  —00,  and 
.X*  =  x~^  if  X  is  finite. 

1  o  ?.  Matriv  Algebra 

We  now  present  the  extension  of  the  belt  operations  to  matrices.  Let  (F ,  V,X  )  be  a 
belt.  Let  be  the  set  of  all  m  x  n  matrices  with  values  in  the  set  F,  and  let 
s  =  (sjj ) ,  t  =  ( tij )  G  Then  we  define 

(sij)V(tij)  =  (sij  Vtjj) 

and  for  (sjj)  G  (tjj  G  we  have 

(Sii)x  (tjt)  =  (.^^IsijXtjJ)  GM„„. 

Suppose  s  G  Mmn  and  t  G  We  say  that  s  and  t  are  conformable  for  addition 
whenever  both  m  =  h  and  n  =q,  and  conformable  for  multiplication  whenever  n  =  h.  For 
the  remainder  of  this  presentation,  we  use  the  notation  F"  and  as  defined  above.  Also, 
we  call  an  n-tuple  or  a  matrix  finite  if  all  its  elements  are  finite,  i.e.  not  equal  to  either 


-t-00  or  —00. 


32 


If  (F ,  V,  X  ,  A,  x')  is  a  belt  with  duality,  then  we  say  that  a  space  (T ,  V)  over  F  has  a 
duality  if 

a  dual  addition  A  is  defined  where  ( T  ,V,  A)  is  an  s^attice  with  duality; 

(T,A)  is  a  space  over  the  belt  (F,A,x'). 

We  also  have  a  dual  matrix  addition  and  dual  multiplication  defined  for  matrices  over  a  belt 
with  duality. 

(Sij)A(tij)  =  (SijAtjj) 

and  for  (sy)  G  At^h.  (tjk)  ^  -Whn.  we  have 

(sij)X'(tj„)  =  (  AjsijX'tjkl) 

with  the  expressions  conformable  for  dual  addition  A  and  conformable  for  dual  multiplication 
x'  used  in  the  obvious  way. 

Let  (F ,  V, X  )  be  a  belt  and  let  Mp,  denote  the  set  of  p  x  q  matrices  with  values  in  F. 
The  following  are  some  basic  properties  that  are  proven  in  [39]. 

(1)  (A<mn>''/)  ^  s-lattice  and  ^  ^  function  space  over  (F,V,X ); 

(2)  ^Mnn.V.X)  isabelt; 

(3)  (MnpiV)  is  a  left  space  over  the  belt  (>(„„, V,X ); 

(4)  Mnp  is  a  right  space  over  the  belt  F; 

(5)  Scalar  multiplication  of  a  matrix  s  by  an  element  X  G  F  is  defined  by 

(Sij)x  X  =  (sijX  X) 

Xx  (Sij)  =  (Xx  sjj) 

for  all(sij)GM„p,XGF; 

(6)  For  all  s  G  A(„n,  t ,  u  G  Mnp,  X  G  F, 


33 


SX  (tVu)=(8X  t)V(sX  u) 
sx  (tx  X)=(sx  t)x  X. 

Since  the  s-lattice  ^  isomorphic  to  the  s-lattice  F",  we  have  F"  is  a  function 

space  over  F  as  well  as  a  space  over  This  mimics  the  classical  role  of  matrices  as  linear 
transformations  of  spaces  of  n-tuples! 

Two  important  matrices  in  our  present  setting  are  the  identity  matrix  and  the  null 
matrix.  Suppose  the  belt  F  has  identity  and  null  elements  6  and  — oo  respectively.  We 
define  the  identity  matrix  e  €  by 


.  -00 


-oo  . 


and  the  null  matrix  ^  €  Mmn  by 


.  —00 


4>  =  . 


— oo 


—00 


—  00 


Thus  we  have  V  s  €  and  for  ^  G  Al. 


e  X  s  =s  X  e  =s 


s  V  =s 


SX  ^  =«t>  X  3  =«t>. 


In  the  bounded  1-group  R  we  have 


0  . 

.  0 

.  — oo 


— oo  . 


34 


and  in  we  have 


e  = 


1 

1 

.  —00 


—00  . 


Conjugacy  extends  to  matrices  if  the  underlying  value  set  is  itself  a  self-conjugate  belt. 
This  is  stated  in  the  next  proposition. 

Proposition  1.1  [39].  If{F,y,X  ,  A,X')  is  a  self-conjugate  belt,  then  (Mnn»V.X  ,A,x')  is 
a  self-conjugate  belt. 


In  linear  algebra,  we  characterize  linear  transformations  of  vector  spaces  entirely  in 
terms  of  matrices.  Are  we  able  to  do  a  similar  classification  here?  The  following  results  give 
necessary  and  sufficient  conditions  for  this  to  be  the  case. 

Theorem  1.2  [39].  Let  F  be  a  belt  which  has  an  identity  element  <f>  with  respect  to  X  and  a 
null  element  0  with  respect  to  — oo.  Then  for  all  integers  m,n  >1,  is  isomorphic  to 
HomF(F",F'"). 

Corollary  1.3  [39].  Let  F  be  a  belt,  and  let  n  >  \  be  a  given  integer.  Then  a  necessary 
and  sufficient  condition  that  be  isomorphic  to  Homp(F",F"')  for  all  integers  n,m  >  1  is 
that  F  have  an  identity  element  (f)  with  respect  to  X  and  a  null  element  6  with  respect  to  V. 

We  call  a  matrix  s  G  a  lattice  transform. 


Many  of  the  results  that  were  stated  in  Cuninghame’s  book  can  be  viewed  in  in  context 
of  a  dual  lattice-ordered  semi-group,  which  has  been  extensively  researched  !36].  However, 
we  wish  to  study  the  structure  from  a  different  perspective.  The  extension  of  the  belt  opera¬ 
tions  to  matrices  allows  us  to  view  matrices  as  operators  on  spaces  of  n-tuples,  in  a  way  simi¬ 
lar  to  vector-space  transformations.  These  operators  are  non-linear  due  to  the  lattice 


35 


structure  of  the  underlying  set  F.  Thus,  we  may  study  this  particular  class  of  non-linear 
transforms  in  a  mathematically  rigorous  setting,  and,  since  an  image  can  be  viewed  as  a  vec¬ 
tor  and  a  template  as  a  matrix  (as  will  be  shown  in  Chapter  2),  apply  results  from  the 
minimax  matrix  theory  directly  to  solve  image  processing  problems.  For  example,  decompo¬ 
sition  of  matrices  corresponds  to  decomposition  of  templates.  This  particular  application  is 
discussed  in  Chapter  5. 


36 


CHAPTER  2 
THE  ISOMORPHISM 

In  his  Ph.O.  dissertation,  P.  Gader  showed  that  linear  algebra  can  be  embedded  into 
the  image  algebra  (32|.  One  very  powerful  implication  of  this  is  that  ail  the  tools  of  linear 
algebra  are  directly  applicable  to  solving  problems  in  image  processing  whenever  the  image 
algebra  operation  0  is  involved.  We  now  show  an  embedding  of  the  minimax  algebra  into 
image  algebra  for  the  two  cases  where  the  belts  are  R  and  R***.  We  employ  the  same  func¬ 
tions  'I'  and  u  as  used  by  Gader  in  his  dissertation. 

Let  X  and  Y  be  finite  arrays,  with  1x1  =  m  and  I  YI  =  n.  .\ssume  the  points  of  X  are 
labelled  lexicographically  Xi.Xj,  -..x^.  .\ssume  a  similar  labelling  for  Y:  Y  =*  {yi.y-i,  ...,yn  }■ 
Let  have  its  usual  meaning.  Let  R^o^  =  {( :  .Xj  G  R^oo  }•  That  is,  RjJ'oo  is 
the  set  of  row  vectors  of  m-tuples  with  values  in  R±oo-  ^  ^  •^mn  denote  the  set  of 
m  X  n  matrices  with  values  in  R^,^  and  define  u  :  R^  — +•  R^o,,  by 

i/{a)=(a(xi),...,a(xj). 

Define  'I' :  by 

\I>(t)  =  M^  =  ( pjj ),  where  Pjj  =  tyj(Xi) . 

Note  that  the  j-th  column  of  Mt  b  simply  ( t'(tj) )',  the  prime  denoting  transpose. 

In  the  following  lemmas,  we  assume  that  I  Xl  =  m.  I  Y1  =  n.  and  I  Wl  =  /.  We  claim 
the  following: 

Lemma  2.1.  i/(a  E3  t)=//(a)x  'l'(t), /or t  G  (R^)^,  a  €  R^. 


Lemma  2.2.  ^'(a  Vb)  «/:/(a)  V  t^(b),  a  €  F^,  F  G  {R,R'*'/. 
Lemma  2.3.  '!'( s  13  t )  = '!'( s )  X  '&(  t ),  for  s  G  ( R^)  t  G  ( R^-oo)  ■ 

Lemma  2.4.  '^(eVt)  =  ^'(s)  V  a,t  G  (F^)  ,FG{R,R  }• 


The  proofs  are  given  below. 

Proof  to  Lemma  2.1. 

We  must  show  that 

(a  IB  t){yk)  =  {Ka)X 

First  note  that  i/(a  BS  t)  is  a  1  x  n  row  vector,  as  is  i/(a)  X  'l'(t).  We  have 
(aQ  t)(yk)  »  ""  .2i*(3£i)+ty^(Xi). 

Also,  ((/(a)  X  ^(t))i5  =  +(^(^))jk  “ 

Q.E.D 


Proof  to  Lemma  2.2. 

^  At  location  X|j,  the  image  a  V  b  has  value  ^ix^)  V  b(X|5).  At  location  k,  the  row  vec¬ 
tor  i/(  a)  V  i^Cb)  has  value  (t/(  a)  )it  V  (i^(b))it  =a(X|5)  V  b(xit). 

Q.E.D 


Proof  to  Lemma  2.3. 

Here,  s  G  (R^co)""  and  t  G  (R'«)^  implies 

s  0  t  =  rG(R^)^, 
and 

r„(X|)  -  J^yw)+».(*l)  = 

Now,  let  'I'(s)  X  'l'(t)  =u  G  M„,d-  ^^e  have 


38 


Uij  =  V^('*'(«))ik+('J'(t))ki  “  V^«WkW+ty/wk)  =  V^tyj(Wk)+s^^(xi). 


Q.E.D. 


Proof  to  Lemma  2.4. 


Here,  s,  t  €  (R^)^.  Then 


(»Vt)yj(Xi)  =  8yj(Xi)  Vtyj(Xi), 


(4^(8)V'P(t))ij  =  ('J'(»))ijV('P(t))ij  =  Sy/xOVty/Xj). 


Q.E.D. 


In  order  to  prove  the  isomorphism  theorem,  we  will  use  the  following  lemma. 
Lemma  2.5.  =  ('l'(t))*,  t  €  (F^)^,  where  F  denotes  either  R  or  R"^.  In  this 

particular  instance  we  Utt*  denote  the  conjugate  template  oft  E  (F^)'^. 

Proof:  Let  s  =  t*.  Then  s  €  (F^^)^,  and 

't(t*)  =  'fr(s)  =  M,  =  (pij),  where  =s,j(yi)  =(t*yyi)  =  ((ty,(Xj)]*, 


Obviously, 


Thus, 


'P(t)  =  Mt  =  (qij),  where  qij  =ty  (xi). 


Pij  =  iv,Nr=(qjir- 


M,=(Pij)=([qiir)=(qij)‘=(M,)*, 

and  we  have 'P(t*)  =  M,  =  (M^)*  =  ('l'(t))*. 


Q.E.D. 


The  following  theorem,  along  with  Lemmas  2.1  through  2.4,  show  how  the  embedding 
of  the  minimax  algebra  into  the  image  algebra  is  accomplished. 


39 


Theorem  2.8.  For  a  finite  array  X,  1x1  =  m, 


{R^,V,A;(R^)^,  0  ,  V,  0  ,  A  :  0  ,  E3  }  is  isomorphic  to 
{R±oo.V,A;A<„„,x ,  V,x',A:x  ,x'}, 

where  ni  x  m  matrices  with  entries  in  the  bounded  l-group  R^^. 


Proof;  By  Lemma  2.1,  r/  preserves  image-template  multiplication,  and  by  Lemma  2.2,  u 

preserves  the  image-image  pointwise  maximum  operation.  By  Lemmas  2.3  and  2.4, 
for  X  =  Y  =  W,  'I'  preserves  the  operations  of  0  and  V  between  templates.  Let 
1  €  (R^)^  denote  the  identity  template  defined  by 


ly(*) 


It  is  trivial  to  show  that  'I'(l)  =  e  €  A<mm> 


if  y  =*x 
otherwise  ' 

the  identity  matrix  in  M^m- 


We  now  show  that  the  operations  of  0  and  A  are  also  preserved  under  It  is  not 
difficult  to  show  that 'I'(s  At)  = 'I'(8)  A  ^(t).  Let  r  =8  At.  Then 
'I>(r)  =  M,  =  ( mij )  =  ( ryj(Xi) ),  where  Type-;)  =  8y^(Xi)  A  ty^Cx;).  Thus, 

*'I'(a)At(t)  =  M,AMt  =  (Sy/x;))  A(tyj(Xi))  =  (3yj(xi)  A  ^Xi))  =  (ryj(Xi)). 

By  definition,  s  0  t  =  (t*  0  a*)*,  and,  using  Lemma  2.5  with  F  =  R,  Lemma  2.3. 
and  property  C3,  we  have 


'I'(s  0  t)  =  'I'((t’  0  s*)*)  =  [^{t*  0  s*))*  =  ('l'(t*)X  ^(s*))* 

=  ('I'(s‘))*x'(vl^(t*))*  =  'I'(s)x''I'(t). 

Thus, 'I'(s  0  t)  =  'I'(s)  x' 'l'(t). 

It  b  straightforward  to  see  that  u  is  on-to-one  and  onto  R±oo-  To  show  that  'I'  is 
one-one  and  onto  let  s,  t  G  (R^)^  and  suppose  that  ^(s)  =  'I'ft).  Then 


40 


('P(s))ij  =  (M,)ij  =  8yj(Xi)  =  ty/Xi)  =  (Me)ij  =  and,  thus, 

Sy/Xj)  =  tyj(Xi)  for  all  j  =  and  for  all  i  = 

So  'I'  is  one-to-one  as  s  =t.  Let  M  =  (mjj)  €  Define  t  G  by 

ty^(xj)  =  niij.  Then  ^(t)  =  M.  Setting  m  =  n,  we  see  that  ^  is  one-one  and  onto 

•^mm- 

Q.E.D. 

Thus,  the  minimax  algebra  with  the  bounded  1-group  embedded  into  image  alge¬ 

bra,  by  the  functions  and  i/“L  As  the  bounded  1-group  R^^  is  isomorphic  to  the 
bounded  1-group  R^o^i  the  minimax  algebra  with  the  bounded  1-group  R^^o  is  also  embedded 
into  the  image  algebra.  In  this  case,  the  matrix  operation  X  corresponds  to  the  image  alge¬ 
bra  operation  ® .  The  isomorphism  result  b  stated  in  Theorem  2.9. 

Let  X  and  Y  be  finite  arrays  as  before.  Let  have  its  usual  meaning,  a  €  (R^)^, 
denote  the  set  of  m  x  n  matrices  with  values  in  R^*,!  and  let  (R±oo)'”  “ 

{(xi,Xo . xj  :  x;  6  R+„  }.  Define  u  :  the  usual  way  by 

i/{a)  =(a(xi),...,a(xj). 

Define  ^  ;  ((R±oo)^)'' 

^(t)  =  Mt  =  ( Pij ),  '^Vhere  Pij  =  tyj(Xi) . 

In  the  following  lemmas,  we  assume  that  1x1  =  m,  1  y1  =  n.  and  1  Wl  =  /.  We  claim 
the  following,  for  a.  b  6  (R±<x>)^: 

Lemma  2.7.  t/(a ®  t)  = //(a)  X  'i'(t), /or  t  G  ((Rioo)^)^- 

Lemma  2.8.  'I>(s©  t)  = 'I'(s)  X  'l'(t), /or  s  G  ((R^)^)"",  t  G  ((Rio)"")^- 


41 


Proof  to  Lemma  2.7. 

We  must  show  that 

(a©  t)(yk)  =  {j/(a)x 

We  have 

(a  ©  tX/k)  =  *  ty^(x)  =  *  ty|^(xi). 

Also,(i/(a)x  'I'(t))k  =  .2^('^(»))j*('J'(<i))jk  =  .?ja(*j)*SW- 

Q.E.D. 

Proof  to  Lemma  2.8. 

Here,  s  €  ((R^)^)"^  and  t  6  implies 

a©t  =  r€((R^f)Y, 
and 

*  »w(Xi)  =  ^^^tyj(Wk)*84xi). 

Now,  let  'I'(s)  X  ^(t)  =  u  €  We  have 
“ij  =  ^^j('J'(s))ik*('J'(t))ki  =  *  tyj(Wk)  =  |^^^tyj(Wk)*s^JXi). 

Q.E.D. 

Theorem  2.9.  For  a  finite  array  X,  u;jlh  Ixl  =  m, 

{  (R±oo)^.  V,  A;((R^oo)^)^.  ©  ,  V,  ®  ,  A  ;©  ,®  }  Js  isomorphic  to 

>  V,X',A;X  ,x'}. 

where  is  the  set  of  allmxm  matrices  with  entries  in  the  bounded  l-group 

Proof:  By  Lemma  2.7,  u  preserves  image-template  multiplication,  and  by  Lemma  2.2,  u 

preserves  the  image-image  pointwise  maximum  operation.  By  Lemmas  2.8  and  2.4. 


for  X  =  Y  =  W,  preserves  the  operations  of  M  and  V  between  templates.  Let 
1  €  {(R^)^)^  denote  the  identity  template  defined  by 


42 


lyW  = 


1  if  y  =  X 
— oo  otherwise 


It  is  trivial  to  show  that  'I'(l)  =  e  G  identity  matrbc  in  over 

In  Theorem  2.6,  the  proof  that  ^{sAt)  =  't(s)  A  ^'(t)  was  not  dependent  on  the 
value  set  R^^oi  ^tnd  hence  is  true  also  for  templates  s,  t  G  We  now 

show  that  the  operation  of  ®  is  also  preserved  under  'I'.  By  definition,  s  ®  t  = 

(t  ®  s),  and,  using  Lemma  2.5  with  F  =R''‘,  Lemma  2.8,  and  property  C3,  we  have 

'I'(a®t)  =  'P((t®T))  =  ['I'(t®i)]*  =  ['I'(t)x  ^s)\* 

=  ('I'(8))*  X'(^'(t))*  =  ^'(s)x''I'(t). 

Thus, 'I'(8®  t)  =  'I'(8)  x''I'(t). 

We  use  the  fact  that  Theorem  2.6  showed  'I'  and  u  are  one-one  and  onto  and  also 
that  and  R^^,  are  isomorphic  as  bounded  1-groups,  and  we  are  done. 

Q.E.D. 

We  have  shown  that  the  minimax  algebra  with  two  different  interpretations  for  the 
bounded  1-group  group  F,  namely  F  =  R  and  F  =  R"^,  is  embedded  in  the  image 

algebra.  Using  the  notation  R^g,  instead  of  R^  allows  the  reader  to  regard  the  value  sets 
R^gg  and  R^gg  38  bosicaUy  the  same  (they  are  isomorphic  as  belts),  without  shifting  gears 
from  using  0  in  one  as  the  bottom  element  and  — oo  in  the  other.  .\11  minimax  properties 
stated  in  Cuninghame-Green’s  book  will  be  valid  in  the  correct  context  of  image  algebra 


notation. 


43 


In  using  the  minimax  algebra  results,  we  would  like  to  point  out  that  the  the  matrix- 
vector  multiplication,  multiplication  of  a  matrix  by  a  vector  from  the  right,  is  used  mostly 
throughout  Cuninghame-Green’s  book.  Left  multiplication  is  mentioned  at  various  places, 
and  in  fact,  most  left  variants  of  the  right  multiplication  results  will  hold.  However,  for  the 
most  part  in  our  applications  to  image  algebra,  we  will  be  using  the  right  multiplication  form 
in  the  development  of  our  theory.  The  functions  'I'  and  u  map  the  image  algebra  expression 
a  □  t  =  b  to  the  matrix  algebra  expression  i/(a)  X  ^(t)  »  t^(b),  the  left  multiplication  form 
which  we  have  omitted  in  our  presentation  of  Cuninghame’s  material.  The  following 
diagram  in  Figure  5  explains  how  we  will  be  taking  advantage  of  the  minimax  algebra 
results. 


a  0  t  i/(a,)  X  ^(t) 


(M'(t))'x  (i.(a)y 

Figure  5.  How  the  Transpose  is  used  in  Conjunction  with  the  Isomorphism. 


Let  T denote  the  function  that  takes  a  matrix  to  its  transpose  as  well  as  the  function  that 


44 


takes  a  template  to  its  transpose.  Thus,  T:  Mmn  -^nm  ^  defined  by 

T(a)  =  a', 

the  prime  denoting  as  usual  the  transpose  of  a  matrix,  and  T;  (F^)^  — ►  (F^)^  is  defined 
by 

T(t)  =  t'. 

Obviously,  ^(T(t))  =  T(^(t)).  In  a  clockwise  manner,  the  functions  u  and  'I'  take  the  pro¬ 
duct  j/(a  □  t)  to  u{a)  X  'l'(t),  which  is  the  matrbc  ^'(t)  multiplied  on  the  left  by  the  row 
vector  L>{a).  Applying  the  transpose  to  i/(a)  X  'I'(t),  we  get  T{^'(a)  X  ^(t))  = 

['l'(t)]'  X  [t'(a)]',  which  is  the  matrix  {♦(t))'  €  Atom  multiplied  on  the  right  by  the  column 
vector  ( t'(a)  ] We  now  use  our  minimax  algebra  theorems,  where  matrix- vector  multiplicar 
tion  is  the  matrbc  multiplied  on  its  right  by  a  column  vector.  After  getting  the  desired 
results,  we  continue  on  around  the  diagram  clockwise,  mapping  back  by  the  transpose  T 
again  and  then  by  or  Formally,  if  d  represents  the  column  vector  which  is  the 
result  of  applications  of  minimax  algebra  theorems  to  the  initial  column  vector 
( X  ( ^'(a))',  then  //“'(T(d))  will  be  an  image.  A  similar  situation  holds  for  templates. 

The  minimax  algebra  results  are  stated  in  the  usual  matrix-vector  multiplication  order, 
and  the  isomorphisms  'F  and  u  are  used  along  with  the  transpose  T  to  apply  the  matrix 
results.  When  the  word  isomorphism  is  used  in  this  context,  it  will  mean  the  above  functions 
^  and  u  explicitly  (not  with  the  transpose)  unless  otherwise  stated,  with  images  as  row  vec¬ 
tors  and  templates  as  matrices  with  images  ty  as  columns. 


45 


PART  II 

\nNIMAX  APPLICATIONS  TO  L\IAGE  ALGEBRA  AND  LVIAGE  PROCESSING 

The  objective  of  the  chapters  in  Part  II  is  to  show  how  the  minimax  algebra  can  be 
used  to  extend  basic  matrix  algebraic  results  in  such  a  way  as  to  have  applications  in  image 
processing.  The  tool  that  makes  the  minimax  algebra  useful  in  image  processing  is  the  iso¬ 
morphism  between  the  image  algebra  and  the  minimax  algebra.  Before  the  research 
presented  in  this  dissertation  was  conducted,  the  relationship  between  the  image  algebra  and 
the  minimax  algebra  had  not  been  established.  The  power  of  the  isomorphism  is  that  it 
makes  all  results  in  the  minimax  algebra  applicable  to  solving  image  processing  problems, 
just  as  linear  algebra  results  are  applicable  to  solving  image  processing  problems.  For  exam¬ 
ple,  template  decomposition  is  presently  a  very  active  area  of  research.  The  problem  of  map¬ 
ping  transforms  to  some  types  of  parallel  architectures  is  equivalent  to  decomposing  a 
transform  t  into  a  product  of  transforms  t  =  M  t'  I3  •  •  ■  M  t'^,  where  each  factor  t'  is 
directly  implementable  on  the  parallel  architecture.  Since  decomposing  templates  is  the 
same  as  decomposing  matrices,  matrix  decomposition  techniques  can  be  applied  to  template 
decomposition  problems.  Thus  far,  there  exist  no  decomposition  techniques  for  matrices 
under  the  matrix  operation  X  as  presented  in  section  1.2.  Hence,  the  methods  developed  in 
Chapter  5  that  decompose  matrices  are  new  results.  They  were  developed  mainly  for  solving 
the  problem  of  mapping  of  transforms  to  particular  parallel  architectures,  though  they  stand 
by  themselves  as  a  new  theoretical  result  in  the  minimax  algebra. 

While  some  other  areas  of  minimax  algebra  may  .seem  to  have  no  current  applications 
to  image  processing,  such  as  the  eigenproblem,  we  present  them  in  their  image  algebra  form 
due  to  their  interesting  mathematical  results. 


46 


CHAPTER  3 

MAPPING  OF  MNIM\X  ALGEBRA  PROPERTIES  TO  L\L\GE  ALGEBRA  PROPERTIES 

This  chapter  is  devoted  to  describing  algebraic  properties  of  the  substructures 
{ ( ,  Q  ,  V ,  0  ,  A },  where  F  is  a  subbelt  of  or  During  the  investigation 
of  the  properties  and  before  the  discovery  of  the  link  to  minimax  algebra,  many  basic  proper¬ 
ties,  such  as  the  associativity  of  the  0  operation,  were  proven  within  the  context  of  the 
image  algebra.  Many  theorems  had  excessive  notational  overhead,  and  often  the  proofs  were 
laborious.  Most  of  these  same  properties  were  found  to  have  been  stated  and  proven  in  con¬ 
text  of  the  minimax  algebra  [38].  Using  the  matrix  calculus  makes  some  proofs  less  tedious, 
and  in  some  cases  makes  them  less  cumbersome  notationally.  Thus,  in  order  to  place  the 
presentation  in  a  more  elegant  mathematical  environment,  we  are  omitting  proofs  that  were 
done  in  the  image  algebra  notation,  and  shall  make  use  of  the  isomorphisms  given  in  the  pre¬ 
vious  chapter.  Most  of  the  theorems  presented  here  are  mapped  into  image  algebra  notation 
using  the  isomorphisms,  and  the  proofs  will  be  omitted.  The  results  will  be  stated  for  both 
bounded  1-groups,  using  the  operations  0  and  ® . 

— Bnsic  Delinitions  nnd  Properties 

Unless  otherwise  stated,  we  shall  assume  that  X.  Y,  and  W  are  finite  coordinate  sets, 
with  1  X|  =  m,  1  YI  =  n.  Iw|  =  k,  with  the  pixel  locations  lexicographically  ordered  as  in 
Chapter  *2.  The  belt  F  with  duality  is  a  subbelt  of  either  R_^  or  The  templates  s  and 
t  will  be  F  valued  templates  on  appropriate  domains,  and  a.  b  will  be  F  valued  images.  For 
the  appropriate  subbelt  F  of  R^^  or  R^^  according  to  the  operation  0  or  ® ,  respectively, 
we  have  the  following  basic  properties. 


47 


(1)  ((F==)^,V)  is  an  s-lattice  and  ((F^)'^,  V)  is  a  function  space  over  (F,V,X); 

(2)  {(F^)^,V,  M  }isabelt;  {(F^)^,V,®  }  is  a  belt; 

(3)  ((F^)^,V)  is  a  left  space  over  the  belt  ((F^)'',V)  is  a  left  space  over  the  belt 

((FV.V.M);  ({FX)X,v,®); 

(4)  (FX)Y  is  a  right  space  over  the  belt  F; 

(5)  We  define  scalar  multiplication  of  a  template  t  6  (F^)'"  by  a  scalar  X  G  F  as  multipli¬ 
cation  by  the  one-point  template  X  G  (F2^)^  or  X  G  (F^s)'",  depending  on  whether 
the  template  X  multiplies  from  the  left  or  from  the  right,  respectively,  (and  adjoining 
—00  to  F  if  necessary),  as 

tMX  =  XMt  =  aG  (F^^)'",  where  Sy(x)  =ty(x)  +  X 
and 

t®X  =  X®t=sG  (F2i,)^,  where  Sy(x)  —  ty(x)  *  X. 

f  X  if  X  =  y 

Here,X,(x)  =  |  <,therwi»' 

Next  we  state  the  distributive  properties  of  0  and  ®  with  respect  to  V. 

(6)  a  0  (t  V  s)  =  (a  0  t)  V  (a  0  s)  a  ®  (t  V  s)  =  (a  ®  t)  V  (a  ®  s) 

a  0  (t  0  s)  =  (a  0  t)  0  s  a  ®  (t  ®  s)  =  (a  ®  t)  ®  s 

(aVb)  0  t  =(a  0  t)  V(b  0  t)  (a  V  b)  ®  t  =  (a  ®  t)  V (b  ®  t) 

(sVt)0u=(s0u)V(t0u)  (sVt)®u=(s®u)V(t®u) 

u0(sVt)=(u0s)V(u0t)  u®(3Vt)=(u®s)\/(u®t) 

3  0  (t  0  u)  =  (s  0  t)  0  U  3  ®  (t  ®  u)  =  (s  ®  t)  ®  U. 

The  dual  to  properties  1  through  6  also  hold,  as  both  the  belts  R  and  R"*"  have  duality. 

(7)  ((?>=)'•,  A)  is  an  s-lattice  and  ((F^)'",  A)  is  a  function  space  over  (F,  A,x'); 

(8)  {(F^)*'^,A,IZ3  }isabelt.  {(F^)^,  A,®  }  is  a  belt. 


etc. 


Now  let  F  be  a  subbelt  of  R  or  R"*",  and  Fj.o(,  the  bounded  1-group  with  group  F . 
Corresponding  to  the  identity  matrix  and  the  null  matrbc  we  have  the  identity  template 


48 


1  6  (F^)^,  defined  by 

I  (f>  if  X  =  y 
“  I  —00  otherwise 

and  the  null  template  ^  €  (F^)'^  defined  by 

$y(x)  =  —00 ,  for  all  y  G  Y,  X  G  X. 

For  the  belt  R,  ^  =0,  and  for  the  belt  R'*’,  ^  =  1.  Thus  we  have 

t  n  1»I  n  t=t  VaGR^,  VtG(R^)^ 

For  <J»  G  (F^)^, 

tV^=t,  t  M  ^  =4»  M  t  =4>,  a  M  $  =null  image,  Va  G  R^,  Vt  G  (R^)^. 
Similar  properties  hold  for  the  operation  ®. 

.*^11  HomnTnnrphisms 

We  now  discuss  homomorphisms  in  context  of  the  image  algebra.  Let  I  Xl  =  m.  Since 
the  s-lattice  {F^,  V}  is  isomorphic  (via  i/)  to  the  s-lattice  {F^o,,,  V},  {F^,  V}  is  aspace. 
For  X  G  F^  the  constant  image,  we  have 

aVX  =  XVa  =  bG  F^,  where  b(x)  =  a(x)  V  X, 
and  for  the  one-point  template  X  G  (R^)^, 

aEaX  =  XEaa  =  bG  F^,  where  b(x)  =  a(x)  -1-  X, 

if  F  =  R,  and 

a®X  =  X®a  =  bG  F^,  where  b(x)  =  a(x)  *  X, 
if  F  =R+  Let  F  G  {R.,,,  R+J.  Since  {F^,  V}  is  an  s-lattice,  an  s-lattice  homomorphism 


49 


from  to  is  a  function  f:  F^  — F^  satisfying 

f(aVb)  =  f(a)Vf(b). 

A  right  linear  homomorphism  g:  F^  — ►  F^  is  an  s-lattice  homomorphism  satisfying 

g(a  M  X)  =  g(a)  M  X, 

Thus,  the  set  of  all  right  linear  homomorphisms  from  F^  to  F'^  is  denoted  by 

HomF( F^ ,  F^ ) ={ g :  F^  — ►F^  ,  and  g  satisfies  g(a\/b)  =g(a)  V g(b),  g(a  □  X)  =  g(a)  0  X  }, 
or  if  F  is  R"*",  then 

HomF(F^,F''^)={g:F^-^F''  ,and  g  satisfies  g(aVb)=g(a)Vg(b),g(a©  X)=  g(a)®  X}. 

3.1  o  Cla<wifinnr,inn  nf  Homomorphisms  in  thft  Tmagft  Algphra 

Right  linear  transformations  can  be  characterized  entirely  in  terms  of  template 
transformations,  and  we  give  necessary  and  sufficient  conditions  for  (F^)'’  to  be  isomorphic 
toHomF(F^,F’^). 

Theorem  3.1.  Let  F  be  a  belt  with  identity  and  null  element.  Then  for  all  non-empty 
finite  coordinate  sets  X,  Y,  (F^)'"  is  isomorphic  to  HomF(F^,F^). 

Corollary  3.2.  LetF  be  a  belt,  and  let's,  ^(f)  be  a  finite  coordinate  set  u)i<A|x1  >  1. 
Then  a  necessary  and  sufficient  condition  that{F^)^  be  isomorphic  to  HomF(F^,F^),  for 
all  non-empty  finite  coordinate  sets  Y,  is  that  F  have  an  identity  element  <f>  with  respect  to 
X  and  a  null  element  9  with  respect  to  V. 

We  call  a  template  t  €  (F^)^  used  with  the  operation  V,  0  ,  0 ,  ® ,  or  ®  a  lattice 
transform.  VVe  will  present  an  e.xample  of  a  transformation  which  is  not  right  linear  in  sec¬ 


tion  6.1. 


Some  useful  inequalities  are  stated  in  the  next  theorem. 


Theorem  3.3.  Let  F  6c  a  subbelt  ofR^^  or  Then  the  following  inequalities  hold  for 
images  and  templates  with  the  appropriate  domains,  having  values  in  F. 

(i)  a\/(b  Ac)  <(a\/b)  A(a  Vc) 

(ii)  a  A  (b  V  c)  >  (a  Ab)  V  (a  Ac) 

(iii)  (aAb)  ta  t<(aM  t)A{b  0  t)  (a  A  b)  ®  t  <  (a  ®  t)  A  (b  ®  t) 

(iv)  a  0  (t  A  s)  <  (a  0  t)  A  (a  0  s)  a  ®  (t  A  s)  <  (a  ®  t)  A  (a  ®  s) 

(v)  (aVb)  □  t>(a  0  t)  V(b  □  t)  (a  V  b)  ®  t  >  (a  (2)  t)  V  (b  O  t) 

(vi)  a  0  (t  Vs)  >  (a  0  t)  V  (a  0  s)  a®  (t  Vs)  >(a®  t)  V(a®  s) 

(i)  8  V  (t  Ar)  <  (s  Vt)  A(8  V  r) 

(ii)  8  A(t  Vr)  >  (s  At)  V(s  A  r) 

(iii)  t  0  (a  A  r)  <  (t  0  s)  A  (t  0  r)  t  ®  (s  A  r)  <  (t  ®  s)  A  (t  ®  r) 

(iv)  (a  Ar)  0  t  <(8  0  t)  A(r  0  t)  (a  A  r)  ®  t  < (a  ®  t)  A (r  ®  t) 

(v)  t  0  (a  V  r)  >  (t  0  a)  V  (t  0  r)  t  ®  (a  V  r)  >  (t  ®  a)  V  (t  ®  r) 

(vi)  (a  V  r)  0  t  >  (a  0  t)  V  (r  0  t)  (a  V  r)  ®  t  >  (a  ®  t)  V  (r  ®  t) 

a0  (s0  r)<(a0a)0  r  and  a0(30r)>(a0a)0r 

t0  (s0  r)<(t0a)0  r  and  t0  (a0  r)>(t0s)0  r 
and 

a®(s®r)<(a®s)®  r  and  a®  (a®  r)>(a®  a)®  r. 
t®(s®r)<(t®a)®r  and  t®  (a®  r)>(t®  a)®  r. 


51 


We  remark  that  the  above  properties  corresponding  to  the  forward  multiplications  of 
an  image  by  a  template  as  defined  in  Chapter  1  are  also  valid,  namely, 
t  Q  (aAb)  <(t  0  a)  A(t  0  b),  etc. 

3.1 .4.  Conjngacy 

The  notion  of  conjugacy  as  discussed  in  section  1.2  extends  to  templates  as  well.  Sup-  - 
pose  that  F  and  F*  are  conjugate.  Then  for  t  €  {F^)\  t*  €  ((F*)^)^  is  defined  by 

tx(y)  =  (ty(x))*. 

The  conjugate  of  t  €  (R^)^  is  the  additive  dual  t*,  and  the  conjugate  of 
t  €  ^  multiplicative  dual  t,  both  of  which  are  defined  in  section  1.1. 

Let  P  be  any  set  of  F  valued  templates  from  Y  to  X,  with  F  and  F*  as  conjugate  sys¬ 
tems.  Define  P*  by 

P*  s  {t’  :teP}. 

Here,  the  star  symbol  *  denotes  the  dual  template  for  either  value  set  or  R^oo-  N®^® 
thatP*C((F')'^)^.  We  have 

Theorem  3.4.  Lef  (F,V)  and  (F*,  A)  be  conjugate.  Then  {(F^)^,V,  0)  and 
(((F*)'')^,A,  □)  are  conjugate,  where  F  is  a  sub-bounded  l-group  ofR.^^  and 
((F^)^,V,©)  and(((F*)^)^,A,®)  are  conjugate,  where  F  is  a  sub-bounded  l-group  of 
R^oj,  for  any  non-empty  finite  coordinate  sets  X,  Y.  In  all  cases  the  conjugate  of  a  given 
template  t  is  the  dual  template  t*  or  t  of  the  respective  bounded  l-group  as  defined  in 
Chapter  1. 

Proposition  3.5.  7/‘(F,V,X,A,X')  is  a  self-conjugate  belt,  then  {{F*  =(F^)^/or  all 

non-empty  finite  coordinate  setsK,  Y.  Also,  ((R^)^,V,0  ,A,1Z3)  is  a  self-conjugate  belt, 
and  ({(R^)^)^,V,®  ,A,® )  is  a  self-conjugate  belt. 


52 


An  pviimple.  In  this  section  we  give  an  application  to  a  scheduling  problem,  showing  the 
use  of  the  conjugate  of  a  template.  In  particular,  this  example  provides  a  physical  interpreta¬ 
tion  of  the  conjugate  of  a  template. 

Suppose  we  have  n  tasks,  or  activities,  or  subroutines,  labelled  l,...,n.  Let  a{x{)  denote 
thv  starting  time  of  task  i,  and  assume  without  loss  of  generality  that  task  1  is  the  starting 
activity,  task  n  is  the  finishing  activity,  and  that  tasks  2  through  n-1  are  intermediate  activi¬ 
ties.  Suppose  we  are  given  the  time  of  the  starting  activity,  and  we  wish  to  know  the 
soonest  time  at  which  each  subsequent  activity  can  be  started.  In  particular,  what  is  the  ear¬ 
liest  time  that  task  n  can  start,  or,  what  is  the  earliest  expected  time  of  completion  of  the 
collection  of  tasks? 

The  relation  of  the  tasks  to  one  another  can  be  described  by  a  partial  order  Z  on  the 
set  of  tasks  {l,...,n}: 

j  i  if  and  only  if  task  j  b  to  be  completed  before  task  i  can  start. 

Let  djj  denote  the  minimum  amount  of  time  by  which  the  start  of  activity  j  must  pre¬ 
cede  the  start  of  activity  i.  That  is,  dy  is  the  the  duration  time  of  activity  j,  or  the  process¬ 
ing  time  of  task  j,  which  must  pass  before  activity  i  can  start.  Define  w  G  (R^)^  by 

I  ^ij  if  j  ^  i 

“  [—00  otherwise’ 

There  is  an  obvious  relationship  between  the  weighted  digraph,  associated  with  the  partial 
order  relation  Z  and  the  template  w.  For  example,  suppose  we  have  5  tasks  or  activities,  or 
subroutines  of  a  program,  which  have  the  following  relation  or  partial  order: 

(1,2)  (1,3)  (2,4)  (2,5)  (3,4)  (3,5)  (4,5) 


Here,  activity  1  is  the  start  activity,  activity  5  is  the  end  activity,  and  tasks  2.3,4  are  inter- 


I 


53 

mediate  tasks  or  subroutines.  Suppose  the  duration  times  djj  of  the  activities  are: 

doj  —  1  djj  —  6  d^2  ~  2 

^43  ~  1  <^52  ~  ^  ^53  ~  3  d5^  =  3 

and  djj  =0  for  each  i  =  1 . 5.  This  is  consistent  with  a  meaningful  physical  interpretation  of 

the  definition  of  duration  time  for  a  task. 

The  corresponding  weighted  digraph  is  given  in  Figure  6. 


Figure  6.  A  Scheduling  Network. 

The  nodes  represent  the  activities,  and  the  duration  times  are  given  as  numbers  on  the 
directed  edges  linking  the  nodes. 

In  determining  3{x^)  for  example,  note  that  a(x4)  must  satisfy 

a(x4)  =  max{d42  +  aCxj) ,  d^j  +  a(x3) ,  d^^  +  a{x^) } 

or  equivalently 

+a(Xj)}. 

This  last  equality  follows  from  the  fact  that  w.j^(Xj)  =  — oo  if  j  is  not  related  to  i.  In  the  gen¬ 
eral  setting,  we  must  solve,  for  each  i  =  l,...,n: 


54 


a(Xi)  =  mM^{Wxj(Xj)  +  a(Xj) } 

or,  writing  the  problem  as  an  image  algebra  expression,  we  must  solve  for  a  in 

a  0  w  =  a.  (3-1) 

Here,  a  is  an  image  on  X  where  |xi  =  n. 

An  analysis  of  a  network  in  this  manner  is  called  backward  recursion  analysis. 

Under  forward  recursion,  suppose  we  have  n  tasks  with  duration  times  fjj,  where  fjj  is 
the  minimum  amount  of  time  by  which  the  start  of  activity  i  must  precede  the  start  of 
activity  j,  if  the  activities  are  so  related.  Otherwise,  let  fjj  have  value  — oo.  Define 
w€(R^J^by 

.  .  _  I  if  j  ^  i 

[—oo  otherwise  ‘ 

As  before,  fjj  =  0  gives  a  consistent  physical  interpretation. 

Let  r  be  the  planned  completion  date  of  the  project,  which  is  given,  and  define  a(Xi)  to 
be  the  latest  allowable  starting  time  for  activity  i.  We  wish  to  determine  a(xi),...,a(Xn_i) 
such  that  a(Xn)  =  r.  Thus,  we  desire  to  solve  for  a  in 

a(xi)  =  min  ( - w  (xj)  +  a(xj) ) 

for  i  =  l,...,n.  For  example,  for  5  nodes,  suppose  we  have  the  following  relations: 

(1,2)  (1,3)  (2,4)  (2,5)  (3,4)  (3.5)  (4,5) . 

Here  we  write  (i,j)  if  task  i  must  precede  task  j.  Suppose  the  times  fjj  of  the  activities  are: 

fjo  =  1  fi3  =  6  f._>j  =  2 

fji  =  1  fes  =  f  fss  f^s  =  3 

Suppose  we  would  like  to  find  say,  satisfying 


The  value  — w^^(x5)  +  sJix^)  is  the  latest  allowable  time  to  start  task  5  minus  the  minimum 

amount  of  time  activity  4  must  precede  activity  5,  and  the  time  to  start  task  4  must  be  at 
least  as  small  as  this  number.  Thus,  the  time  to  start  task  5  must  be  at  least  as  small  as  -1 
+  a(x5).  The  value  a(x4)  =  min  {  +  a(x5) }  =  -1  +  r.  (All  other  values 

— Wx^(xj)  +  a(xj)  =  +00  as  — ^^^(Xj)  =  +oo  for  j  5^  5.)  Since  r  is  given,  this  quantity  can  be 
explicitly  determined.  The  remaining  equations  can  be  solved  similarly. 

If  we  define  u  €  (R^)^  by 

if  j « i 

“x,(*j)  ~  [  ^.QO  otherwise 
then  it  is  obvious  that  in  general  we  must  solve  for  a  the  following: 

a  Q  u  =  a.  (3*2) 

It  is  clear  that  the  template  u  in  equation  (3-2)  is  the  conjugate  of  the  template  w  in  Equa¬ 
tion  (3-1).  That  is, 

u  =  w*. 

VVe  can  say  that  the  templates  w  and  w*  define  the  structure  of  the  network  as  we  analyze  it 
backward  or  forward  in  time,  respectively. 


3.1.5 _ Alternating  tt*  and  tt  Product.s 

This  section  discusses  the  concept  of  an  alternating  tt*  or  tt  product  of  a  template  t 
and  its  conjugate  under  the  operation  13  or  ® ,  respectively.  We  shall  state  the  results  only 
for  the  sub-bounded  l-groups  of  the  operations  13  and  IZ3  ,  with  the  understanding 

that  unless  otherwise  stated,  an  arbitrary  sub-bounded  l-group  of  Rjl^^  and  the  operations  ® 
and  ®  may  be  substituted  in  the  appropriate  places. 


56 


Theorem  3.6.  LetF^^  be  a  sub-bounded  l-group  o/R^oo,  where  F  denotes  the  group  of 
the  bounded  l-group  F^ooi  ^  ^  Then  we  have 


tM(t*iat)  =  t0(t*i3t)=(tiat*)i3t  =  (tiat‘)iz3t=t. 

Similarly, 

t*  M  (t  0  t*)  =t‘  0(t  0  t*)  =(t*  0  t)  0  t’  =(t*  0  t)  0  t‘  =t’. 

We  now  define  an  alternating  tt*  product.  Write  a  word  consisting  of  the  letters  t  and 
t*,  in  an  alternating  sequence.  A  single  letter  t  or  t*  is  allowed.  If  we  have  k  >  1  letters, 
now  insert  k-1  symbols  of  0  and  0 ,  in  an  alternating  manner.  For  example,  the  following 
sequences  are  allowed: 

t*  0  t 
t  0  t*  0  t 

t*  0  t  0  t*  0  t  0  t‘  0  t. 

Now  insert  brackets  in  an  arbitrary  way  so  that  the  resulting  expression  is  not  ambiguous. 
For  example, 

t*  0  t 

t  0  (t*  0  t) 

(t*  0  ((t  0  t*)  0  t))  0  (t*  0  t). 

Any  algebraic  expression  so  constructed  is  called  an  alternating  tt*  product. 

Suppose  an  alternating  tt*  product  an  odd  number  of  letters  t  and/or  t*.  Then  we  say 
it  is  of  type  t  if  it  begins  and  ends  with  an  t,  and  that  it  is  of  type  t*  if  it  begins  and  ends 
with  an  t*.  If  it  has  an  even  number  of  letters  we  say  that  it  is  of  type 

t  0  t*  or  t  0  t*  or  t*  0  t 


exactly  according  to  the  first  two  letters  with  its  separating  operator,  regardless  of  how  the 


57 


brackets  lie  in  the  entire  expression.  As  an  example: 


t*  0  t 

is  of  type 

t*  0  t 

t  0  (t*  0  t) 

is  of  type 

t 

(t*  0  ((t  0  t*)  0  t))  0  (t*  0  t) 

is  of  type 

t*  0  t. 

Theorem  3.7.  LetF^^  be  a  sub-bounded  l-group  and  t  an  arbitrary  template  in 

(F^)^.  Then  every  alternating  tt*  product  P  is  well-defined,  and  ifF  is  of  type  Q,  then  P 

=  Q. 


If  a  product  P  has  more  than  1  letter,  then  we  define  P(z)  to  be  the  formal  product 
obtained  when  the  last  (rightmost)  letter,  t  or  t*  (or  t),  is  replaced  by  z,  where  z  is  a  F 
valued  template  on  the  appropriate  coordinate  sets  X  and  Y. 

Theorem  3.8.  LetF^^yg  be  a  sub-bounded  l-group  o/Rj.oo>  ond  t,  z  arbitrary  templates 
over  F.  If  P  is  an  alternating  tt*  product  containing  four  letters  and  P  is  of  type  Q,  then 

P(z)  =  Q(z). 

3.2.  Systems  of  Eqiintinns 

We  now  discuss  the  problem  of  finding  solutions  to  the  problem: 

Given  t  €  (R^)^  and  b  €  R^o,  find  a  €  R^  such  that  a  M  t  =  b .  (3-3) 

Similarly,  we  also  wish  to  solve: 

Given  t  G  ((R±oo)^)^  b  €  (R^)\  find  a  G  (R^*,)^ such  that  a  ®  t  =  b. 
Here.lxl  =m,lY|  =n. 


58 


3.2.1  ■  F-asticitv  and  /-SQliit.ions 

If  F  is  a  bounded  1-group  and  x,  y  €  F,  we  say  that  the  products  x  X  y  and  x  x'  y  are 
/•undefined  if  one  of  x,y  is  — oo  and  the  other  is  -foo.  We  say  that  a  template  product  is  /- 
undefined  if  the  evaluation  of  ty(x)  requires  the  formation  of  a  /-undefined  product  of  ele¬ 
ments  of  the  bounded  1-group  Otherwise,  we  say  that  a  template  product  is  /-defined 
or  /-exists.  Some  mathematical  models  require  solutions  which  avoid  the  formation  of  /- 
undefined  products,  as  in  practical  cases  these  often  correspond  to  unrelated  activities.  We 
state  these  results  for  both  bounded  1-groups  where  appropriate,  with  the  results  in 
parentheses.  As  usual,  the  sub-bounded  1-group  F±oo  is  dependent  on  which  operation,  M  or 
®  is  used. 

Lemma  3.9.  Let  F^oo  be  a  subbelt  Let  X  and  Y  be  non-empty,  finite 

arrays,  and  t  €  (F^)'^.  Then  the  set  of  all  images  a  €  F^  such  that  a  Q  t  (a  ®  t)  is 
/-defined  is  a  sub-s-lattice  o/F^.  Hence  the  set  of  solutions  a  of  statement  (3-3)  such  that 
a  M  t  (a  ®  t)  /-exists  is  either  empty  or  is  a  sub-s-lattice  o/F^. 

Lemma  3.10.  Let  X,  Y,  and  W  be  non-empty,  finite  arrays,  and  t  S  (F^Jl)^.  Then  the 
set  of  templates  s  G  (F^)"^,  such  that  s  M  t  (s  ®  t/  is  /-defined  is  a  sub-s-lattice  of 

Any  solution  a  of  statement  (3-3)  such  that  a  El  t  (a  ®  t)  /-exists  is  called  a  /-solution 
of  (3-3). 

Theorem  3.11.  LetY^^^  be  a  sub-bounded  l-group  of'R^^{B.^,J.  Then  (3-3)  has  at  least 
one  solution  if  and  only  i/  a  =  b  0  t*  (a  =  b  ®  t)  fs  a  solution.  In  this  case,  a  =  b  0  t* 
(a  =  b  <2)  t)  is  the  greatest  solution. 


59 


Recall  from  probability  theory  that  a  row-stochastic  matrix  is  a  non-negative  matrix  in 
which  the  sum  of  the  elements  in  each  row  is  equal  to  1.  We  will  make  analogous  definitions, 
where  the  operation  +  is  replaced  by  the  operation  V,  and  the  unity  element  is  —  oo. 


Let  P  C  Fj.00,  where  is  an  arbitrary  sub-bounded  1-group  of  (R-iJ-  A  tem- 


^  t' (/i)  G  P  for  all  j  =  l,...,m.  The  template  t  is  called  doubly-P-astic  if  t  is  both  row-  and 
i— 1  ' 


column-P-astic.  Note  that  if  t  is  column-P-astic,  then  t'  is  row-P-astic. 

Theorem  3.12.  LetF^oo  be  a  sub-bounded  l-group  ofR^^  (R±oJ  o^d  t  G  (F^)\  b 
G  F21o  that  (3-3)  is  soluble.  Then  a  =  b  123  t*  (a  =  b  ®  t)  /-exists  and  is  a  /-solution 
o/(3-3),  if  and  only  if  one  of  the  following  cases  is  satisfied: 

(i)  t  G  (F^)^,  and  b  =  -f-oo,  the  constant  image  with  +oo  everywhere. 

(ii)  t  G  (F^)'',  and  b  =  — oo. 

(iii)  t  G  (F^)'^  is  doubly  F-astic,  and  b  G  F^. 

Moreover,  every  solution  o/(3-3)  is  then  a  /-solution,  and  b  123  t*  (b  ®  t)  is  equal  to  -f  oo, 

— oo,  or  is  finite,  respectively  according  as  case  (i),  (ii),  or  (iii)  holds. 

In  the  following  theorem,  we  state  the  dual  and  left-right  generalizations  of  Theorems 
3.11  and  3.12. 

Corollary  3.13.  Let  F^^g  be  a  sub-bounded  l-group  and  let 

t  G  (F2^)'^,  b  G  F^^.  Then  for  all  combinations  of  c,q,  and  6  given  in  Table  1.  the  follow¬ 
ing  statement  is  true: 

The  image  algebra  equation  c  has  at  least  one  solution  if  and  only  if  the  product  d  is 
a  solution;  and  the  product  d  is  then  the  6  solution.  Furthermore,  if  the  product  d 
is  /-defined,  and  equation  c  is  /-defined  when  a  =  d,  then  equation  c  is  /-defined 
when  a  is  any  solution  of  equation  c.  is  a  sub-bounded  l-group  o/R^oo» 


60 


the  results  in  Table  1  hold  for  ®  replacing  0  everywhere  and  t  replacing  t*  every¬ 
where. 


Table  1. 


c 

d 

6 

a  0  t  =:b 

b  0  t* 

greatest 

a  0  t*  =  b 

b  0  t 

greatest 

a  0  t  =  b 

b  0  t* 

least 

a  0  t’  =  b 

b  0  t 

least 

t  0  a  — b 

t*  0  b 

greatest 

t*  0  a  =  b 

t  0  b 

greatest 

t  0  a  =:b 

t*  0  b 

least 

t*  0  a  =  b 

t  0  b 

least 

If  d  is  a  solution  to  c  in  Table  1,  then  d  is  called  a  principal  solution. 

We  can  also  restate  the  last  three  theorems  as  a  solubility  criterion. 

Problem  (3-3)  is  soluble  if  and  only  if(b|Z3  t*)0  t  =  b((b®t)©t)=b);  and  every 
solution  is  a  /-solution  if{b  0  t*)0  t((b®t)®  t)=b)  /-exists. 

Note  that  Theorem  3.12  identifies  the  cases  in  which  (3-3)  has  a  /-defined  /-solution. 

All  solutions  are  then  /-solutions.  The  next  question  to  ask  is:  can  we  find  all  solutions?  We 
now  focus  on  the  following  problem. 


61 


Given  that  F  is  (R±<J  and  that  (b  E3  t* )  M  t )  ((b  ®  t )  ®  t )]  (3-4) 

/-exists  and  equals  b,  find  all  solutions  of  (3-3). 

For  cases  (i)  and  (ii)  of  Theorem  3.12,  we  note  that  t  is  finite.  The  next  proposition 
gives  solutions  for  these  two  cases. 

Proposition  3.14.  Let be  a  sub-bounded  l-group  Ifh  =  — oo  (the  con¬ 

stant  image),  then  Problem  (3-4)  has  b  as  its  unique  solution.  Ifh  =  -foo,  then  Problem  (3- 
4)  has  as  its  solutions  exactly  those  images  o/F^  which  have  at  least  one  pixel  value  equal 
to  -t-oo. 

To  determine  solutions  to  cose  (iii),  we  need  to  consider  the  particular  case  that  F^oq  is 
the  3-element  bounded  1-group  F3.  Here  b  is  finite  with  all  elements  having  value  <f>. 

Lemma  3.15.  Let  F^oo  be  the  S-element  bounded  l-group  F3.  Let  t  be  doubly  F-astic  and 
b  be  finite.  Then  (3-3)  is  soluble,  having  as  principal  /-solution  a  =  1  where  l(x)  =  (f>  for  all 
i.  Hence,  no  solution  to  (3-3)  contains  -foo/or  any  pixel  value,  and  all  solutions  are  /- 
solutions. 

3.2.2.  All  Solutions  to  a  M  t  =  b  and  a  <Q  t  =  b 

We  now  give  some  criterions  for  finding  all  solutions  to  problem  (3-3)  for  the  case  where 
the  template  t  is  doubly  F-astic  and  b  finite.  We  discuss  the  general  case  where  F  is  the 
belt  R  or  R"*". 

If  a  template  t  G  (F^)^  has  form 

tx,(xi)  =  «i.  and  t^(xj)  =  -00,  j  5^  i, 


we  write  t  =  diag(ai,  oto,  ... ,  «„,). 


62 


For  b  €  F  finite,  define  the  template  d  G  by  d  = 

diag((b(xi)]*,  (b(xo)]*,  .  .  . ,  [b(xj]*). 

Since  b  is  finite,  so  is  d,j|(xj),  and  —  — b(xj)  (or  l/b(xj))  V  i  =  Thus,  solving 

(3-3)  is  equivalent  to  solving 

a  M  9=1,  (3-5) 

or 

a  s  =  1, 

where  8=dM  t(9=d®t)G  (F^)'’,  and  1  =-^,  the  constant  image.  Note  that 

=  {(Xj  ,yi) ;  s^yj)  =  ^^^^s^y^)  }.  Note  that  W'j  C  X  X  Y  for  every  j.  The  elements  s^yO 

corresponding  to  (xj  ,yj )  G  W'j  are  called  marked  values  of  W'j.  Notice  that  every  image 
will  have  at  least  one  marked  value,  as  d,  t  and  s  are  doubly  F-astic.  Our  next  theorem 
gives  conditions  where  there  is  no  solution. 

Lemma  3.16.  LetF^gg  be  a  bounded  l-group,  t  G  (F2^)^  where  t  is  doubly  F-astic,  and 
bGF^  Define  a  eiF^gg)'^  by 

s  =  dMt(or  s=d®t) 

depending  on  whether  the  group  F  is  R  or  R"^,  respectively,  and  d  is  as  above.  Suppose 
there  exists  i  such  that  for  no  j  is  a  marked  value.  That  is,  suppose  there  exists 
y-j  G  Y  such  that  is  not  a  marked  value  for  any  j.  Then  there  does  not  exist  a  G  F^ 
such  that  a|21  t=b(a®  t=b). 

There  now  remains  the  case  in  which  for  every  i,  there  is  at  least  one  j  such  that  Syj(Xj) 
is  a  marked  value.  We  transform  the  question  into  a  boolean  problem,  where  it  can  be 


63 

shown  that  the  following  procedure  will  give  a  set  of  solutions  to  equation  (3-5)  [39]. 

Step  1.  For  the  bounded  1-group  define  g  €  (F^)"'''  by 

I  ^  if  syyi)  is  marked 

8yi(^j)  “  [  _<X5  otherwise 

Letting  f  G  F^,  now  solve  the  boolean  system 

fl3g=l  (or  f©g  =  ^).  (3-6) 

As  in  the  case  for  matrices  [39],  each  solution  to  equation  (3-6)  consists  of  an  assignment  of 
one  of  the  values  — ooor  ^  to  each  f(xj). 

Let  f  =  (f(xi),...,f(x„))  be  a  solution  to  equation  (3-6). 

Step  2.  For  each  j  =  l,...,m:  if  f(Xj)  =  (f)  then  set  a(Xj)  to  be  the  value  —  ^Vs'^^  j 

(l/(  |Vs'xj  j).  If  f(Xj)  =  —00  then  a(Xj)  is  given  an  arbitrary  value  such  that  a(xj)  < 

For  the  boolean  case,  we  have 

Proposition  3.17.  The  solutions  of  equation  (3-6)  are  exactly  the  assignments  of  the  values 
(j)  or  —00  to  the  variables  f(Xj)  such  that  for  every  i  =  l,...,m  there  holds  f(Xj)  =  <i>  for  at  least 
one  j  such  that  is  a  marked  value. 

Theorem  3.18.  Let  be  a  bounded  l-group.  Then  the  above  two  step  procedure  yields 
all  solutions  to  equation  (3-7)  without  repetition. 

2..1.  FA-ist.pnrfi  and  Uniqueness 

This  section  discusses  some  existence  and  uniqueness  theorems  concerning  solutions  to 


Problem  (3-3). 


64 


Theorem  3.19.  LetF^g^  ®  bounded  l-group,  and  /cf  t  €  (F2^)^  6e  doubly  F-ashc  and 
b  €  F^  6e  finite.  Then  a  necessary  and  sufficient  condition  that  the  equation  a  0  t  =  b 
(a  ®  t  =  b)  shall  have  at  least  one  solution  is  that  for  all  Xj  6  X,  there  exists  at  least  one  j 
such  that  for  the  template  s  =  d  0  t  (s  =  d  ®  t),  where  d  is  as  defined  as  above, 

Sy|(xj)  is  a  marked  value . 

We  remark  that  the  solution  a(Xj)  =  -  (l/(  ^Vs'^j  ]))  gives  exactly  the  principal 

solution. 

This  is  equivalent  to 

Theorem  3.20.  LetF^oo  be  a  bounded  Ugroup,  lett  €  (Ff^)'’  be  doubly  F-astic,  and  let 
b  6  F^  6c  finite.  Then  a  necessary  and  sufficient  condition  that  the  equation  a  0  t  =  b 
(a  ®  t  =  b)  shall  have  exactly  one  solution  is  that  for  all  Xj  G  X,  there  exists  at  least  one  j 
such  that 

Sy,{Xj)  ®  marked  value , 

and  for  each  j  =  l,...,n,  there  exists  an  i,  1  <  i  <  m  such  that  I  W'||  =  1. 

Define  a  template  t  G  (F^)^  to  be  strictly  doubly  (j>-astic  if  it  satisfies  the  following 
two  conditions. 

(i)  ty|(xj)  <  d>,  i,j=l,-,n 

(ii)  for  each  i  =  l.-.n,  there  exists  a  unique  index  j  G  { l,2,...,n  }  such  that  ty|(Xj)  has  value  6. 

IftG{F,^;)Y  Ixl  =  m,  1  yI  =  n,  then  we  say  that  t  contains  a  template 
s  £  if  the  matrix  contains  the  matrix  of  size  h  x  k,  where  I  Wol  =  h, 

I  Wj  =  k,  and  both  h,k  <  min(m,n).  We  say  that  a  template  t  G  (F^)''  contains  an  image 
a  G  F^  if  a  =  ty  for  some  y  G  Y. 


65 


Theorem  3.21.  LetF^^^  be  a  bounded  Ugroup,  lett  G  (F^)^  be  doubly  F-astic,  and  let 
b  G  6c  finite.  Then  a  necessary  and  sufficient  condition  that  the  equation  a  M  t  =  b 
(a  ®  t  =  b)  shall  have  exactly  one  solution  is  that  we  can  find  k  finite  elements  ... 
such  that  the  template  d  defined  by 

**yi(*i)  =*  **y,(*j)  =  *  "j 

is  doubly  Rustic  and  that  d  contains  a  strictly  doubly  toastie  template  sG(F'J^)"^,  I  Wl  =k. 


.1.g.4.  A  T.inear  PrngrammmgOritprinn 


Since  one  of  our  interests  is  the  case  where  the  bounded  1-group  is  the  we  now 
show  that  the  problem  can  be  stated  as  a  linear  programming  problem  for  this  bounded  1- 
group. 

Theorem  3.22.  Let  t  G  (R^)^  be  doubly  F~astic  and  b  G  F'"  6c  finite.  Let  I  be  the  set 
of  index  pairs  (i,j)  such  that  type-^  is  finite,  1  <  i  <  n,  1  <  j  <  m.  Then  a  sufficient  condi¬ 
tion  that  the  equation  a  □  t  =  b  6c  soluble  is  that  some  solution  {  ^jj  |  (i,j)  G  I  }  o/  the  fol¬ 
lowing  optimization  problem  in  the  variables  Zjj,  for  (i,j)  G  T 


Minimize  S  (bfy:)  —  tvixillz:: 


Subject  to 


and 


S  ^'1 


=  1,  j  =  l,...,m 


.(ij)ei  ; 

Zij  >  0.  (»J)  ^  I 


i3i  c  all  also  satisfy: 


>  0,  i  = 


We  now  make  a  definition  which  will  be  used  in  the  next  section.  Let  Fj.oo  be  a  belt, 
and  let  t  G  (F^)^  be  arbitrary.  The  right  column  space  of  t  is  the  set  of  all  b  G  F^  for 


66 


which  the  equation 


is  soluble  for  a. 


aI3t=b  (or  a®t=b) 


T.inear  DpppnHpncfi 

T.inear  HpppnHpnof*  nvpr  a  brtiinHed  l-gronp.  We  can  consider  the  equation  a  M  t  =  b 
(or  a  ®  t  =  b)  in  another  way.  For  the  images  rewrite  a  0  t  =  b  as 

Q  a(Xj)l  =b,  (3-7) 

where  a(xj)  €  (F^)'^  is  the  one-point  template  with  target  pixel  value  of  a(Xj).  In  this  case, 
we  say  that  b  is  a  linear  combination  of  or,  that  b  6  is  (right) 

linearly  dependent  on  the  set  We  can  make  analogous  definitions  for  the 

case  of  ® .  While  in  linear  algebra  the  concept  of  linear  dependence  provides  a  foundation 
for  a  theory  of  rank  and  dimension,  the  situation  in  the  minimax  algebra  is  more  compli¬ 
cated.  The  notion  of  strong  linear  independence  is  introduced  to  give  us  a  similar  construct. 

Theorem  3.23.  Let  F^gg  be  a  bounded  l-group  other  than  F3.  Let  "X.  be  a  coordinate  set 
such  i/iof  Ixl  >  2,  and  k  >  1  6e  an  arbitrary  integer.  Then  we  can  always  find  k  finite 
images  on  X,  no  one  of  which  is  linearly  dependent  on  the  others. 

If  F^5,q  =  F3,  then  we  can  produce  a  dimensional  anomaly. 

Theorem  3.24.  Suppose  F^gg  =  F3,  and  let  X  be  a  coordinate  set  such  that  1x1  =  m  > 

2.  Then  we  can  always  find  at  least  (m"  -  m)  images  on  X,  no  one  of  which  is  linearly 
dependent  on  the  others. 

Since  every  bounded  1-group  contains  a  copy  of  F3,  the  dimensional  anomaly  in 
Theorem  3.24  extends  to  any  arbitrary  bounded  1-group. 


I 


Let  I  Xl  =  m,  I  YI  =  n,  and  t  6  ( where  F  is  an  arbitrary  bounded  1-group.  We 
would  like  to  define  the  rank  of  t  in  terms  of  linear  independence,  and  to  be  equal  to  the 
number  of  linearly  independent  images  of  t.  Suppose  we  were  to  define  linear  indepen¬ 
dence  as  the  negation  of  linear  dependence,  that  is,  a  set  of  k  images  on  X  ( a^,  .  .  .  .aj^ )  is 
linear  independent  if  and  only  if  no  one  of  the  a;  is  linearly  dependent  on  any  subset  of  the 
others.  Then  applying  Theorem  3.23  for  I  Xl  =  n  and  k  >  n,  we  could  find  k  finite  images 
which  are  linearly  independent.  If  we  defined  rank  as  the  number  of  linearly  independent 
images  ty  of  t,  then  every  template  would  have  rank  k  >  n,  which  is  not  a  useful  definition 
in  this  context. 

Strong  linear  independence.  As  for  the  matrix  algebra,  we  define  the  concept  of  strong 
linear  independence  [39). 

Let  F^oo  be  a  bounded  1-group  and  let  a(l),...,a(k)  6  F^,  k  >  1.  We  say  that  the  set 
{a(l),...,a(k)}  is  strongly  linearly  independent,  or  simply  SLI,  if  there  is  at  least  one  finite 
image  b  G  F^  which  has  a  unique  expression  of  the  form 

b  =p^ja(jp)  M  (or  b  =  ®  XjJ  (3-8) 

with  Xj^  G  F,  p  =  l,...,h,  1  <  jp  <k,  p  =  l,...,h,  and  jp  <  jq  if  p  <  q. 

If  =  {ai,a.o,  .  .  .  ,a|5 }  is  a  set  of  k  images  where  each  a;  G  F^^,  |  y1  =  n,  then  we 
define  the  template  based  on  the  set  A  in  the  following  way.  For  the  integer  k.  we  find  a 
coordinate  set  W  which  has  k  pixel  locations,  that  is,  I  W|  =  k.  To  this  end.  choose  a  posi¬ 
tive  integer  p  such  that  k  =  p  ■  q  -1-  r,  where  r  <  p  (by  the  division  algorithm  for  integers). 
Let  W  denote  the  set  ((i,j) :  0  <  i  <  p— 1,  0  <  j  <  q— 1  }  U  {(— l,j)  :  0  <  j  <  r— 1 },  which  is 
a  subset  of  Z"  that  is  almost  rectangular.  There  is  an  additional  row  in  the  fourth  quadrant 


68 


corresponding  to  the  r  left-over  pixel  locations  that  don’t  quite  make  a  full  row.  Of  course, 
there  are  other  selections  that  can  be  made  for  W.  Define  the  template  t  based  on  A  by 
t  e  (FW)Y 

=  as,  i  =  l,...,k. 

To  clarify  notation,’ we  will  denote  the  template  based  on  the  set  A  =  {ai,ao,  .  .  .  ,a|j}  by  t 
=  Hence,  if  t  G  (F^)\  then  for  A  =  •  •  •  ,  we  have  B(>()  =t.  If  P  = 

{ai,a2,  .  .  .  ,aj,}  is  a  set  of  h  F  valued  images  on  X,  we  denote  the  right  column  space  of 
B(P)  by  <  ai,ao,  •  .  .  ,ai,  >.  Thus,  for  t  G  (F^)\  <  is  the  right  column 

space  of  t.  The  set  <  aj,ao,  .  .  .  ,a(,  >  is  also  called  the  space  generated  by  the  set 

{ai,ao,  .  .  .  ,a(,}. 

Lemma  3.25.  Let  be  a  bounded  Ugroup  with  group  F.  Let  Cj,  .  .  .  ,C|(,b  G  F^,  k  > 
1  be  such  that  b  is  finite  and  has  a  unique  expression  of  the  form  (3-8).  Then  h  =  k;  jj  = 
1,...,  j[,  ==  k;  Xjp  G  F,  p  =  l,...,h;  and  t  is  doubly  F~astic,  where  t  G  (F^)^  is  the  template 
based  on  the  set  C  =  {cj,  .  .  .  ,C|5 }.  Here,  1  y1  =  k. 

We  also  have 

Corollary  3.26.  Let  Fj^^  be  a  bounded  l-group  and  let  Cj,  ...,€„  G  F^  for  an  integer  n 
>  1.  Then  (cj,  ...,€„}  is  SLI  if  and  only  if  there  exists  a  finite  image  b  G  F^  such  that 
the  equation  a  M  t  =  b  (a  ®  t  =  b)  is  uniquely  soluble  for  a,  where  t  G  (F2^)^  is  the 
template  based  on  the  set  C  =  {cj,  t  =  B(C),  I  Y|  =  n. 

We  can  now  define  linear  independence.  Let  F^o<>  a  given  belt.  Then  linear 
independence  is  the  negation  of  linear  dependence:  Cj,  ...,€„  G  F^  are  linearly  indepen¬ 
dent  when  no  one  of  them  is  linearly  dependent  on  the  others.  How  is  linear  dependence 
related  to  strong  linear  independence? 


69 


Theorem  3.27.  LetT^^  he  a  bounded  l-group,  and  Ci,  F^.  Forcj,  .  .  .  ,C|j  to 

be  linearly  independent  it  is  sufficient,  but  not  necessary,  that  Cj,  .  .  .  ,C|j  be  SLI. 

We  may  call  the  above  definition  of  SLI  right  SLI.  If,  in  the  definition  of  SLI,  we  were 
to  multiply  by  the  scalars  Xj’s  from  the  left,  we  define  the  concept  of  left  SLI.  If  formula 
(3—8)  is  replaced  by 

^  (  or  b  =  pA  a(jp)  <2>  Xj^ 

then  we  have  the  concept  of  right  dual  SLI.  We  define  in  an  analogous  way  the  concept  of 
left  dual  SLI. 


3.3.  Rank- of  Templates 

Template  rank  over  a  hounded  Ugmiip  T.et  be  a  bounded  I-group  and  t  €  (F^)^ 
be  arbitrary.  We  call  the  template  t  (right)  or  left  column  regular  if  the  set  of  images 
{t'x  }x6X  (right)  or  left  SLI,  respectively.  We  say  t  is  right  or  left  row  regular  if  the  tem¬ 
plate  t'  is  right  or  left  column  regular,  respectively. 

Now  suppose  that  Fj.oo  is  a  bounded  1-group  and  t  G  (F^)'".  Suppose  r  is  the  max¬ 
imum  number  of  images  of  t  that  are  SLI.  In  this  case  we  say  that  t  has  column  rank 
equal  to  r.  The  row  rank  of  t  is  the  column  rank  of  t'.  For  a  template  t  G  (F^)^,  we  say 
that  t  has  <f>-astic  rank  equal  to  r  G  Z"*"  if  the  following  is  true  for  k  =  r  but  not  for  k  >  r: 

Let  W  be  a  coordinate  set,  Iwl  =  k  <  min(m,n).  There  exist  a  G  F^  and  b  G  F^, 
both  finite,  such  that  the  template  s  G  (F^)^  is  doubly  Mastic  and  s  contains  a 
strictly  doubly  Mastic  template  u  G  (F'J^)"^,  where 

Sy,(xj)  =  b(yi)  +ty|(xj)  +  a(Xj),  V  i  =  l,...,n  and  j  =  l,...,m 


if  F  =  R,  and 


70 


*yi(*j)  =  ^(yi)  *  *  a(Xj),  V  i  =  1,  ..,n  and  j  = 

ifF  =R+ 

Lemma  3.28.  Let'F^g^  be  a  bounded  Ugroup  with  group  P  G  (RjR"^},  and  suppose  that 
t  G  (F^)y  has  <f)-astic  rank  equal  to  r.  Then  t  is  doubly  F-astic  and  t'  contains  a  set  of  at 
least  r  images,  ,  k=l,...,r,  which  are  SLI. 

■  J|£ 

Lemma  3.29.  LefFG  {R,R'*’},  and  suppose  thatt  G  (F^)'^  is  doubly  F-astic  and  con¬ 
sists  of  a  set  of  r  images  which  are  SLI.  Then  t  has  <f>-astic  rank  equal  to  at  least  r. 

Accordingly,  we  have 

Theorem  3.30.  LetFG  {R.R'*’},  and  suppose  that  t  G  (F^)^  is  doubly  F-astic.  Then 
the  following  statements  are  all  equivalent: 

(i)  t  has  ^astic  rank  equal  to  r 

(ii)  t  has  right  column  rank  equal  to  r 

(iii)  t  has  left  row  rank  equal  to  r 

(iv)  t*  has  dual  right  column  rank  equal  to  r 

(v)  t*  has  dual  left  row  rank  equal  to  r  . 

If  t  is  doubly  F-astic,  then  we  can  apply  Theorem  3.30  and  simply  use  the  term  rank  of 
t,  for  ranks  (i)  to  (iii),  and  the  term  dual  rank  of  t  for  ranks  (iv)  and  (v).  If  the  bounded  1- 
group  F^<„  is  commutative,  as  in  both  our  cases,  we  have  the  following 

Corollary  3.31.  Let  FG  {R,R'*'},  and  let  t  G  (F^)^  be  doubly  F-astic.  Then  the  fol¬ 
lowing  statements  are  all  equivalent: 

(i)  t  has  left  column  rank  equal  to  r 

(ii)  t  has  right  row  rank  equal  to  r 

(iii)  t*  has  dual  left  column  rank  equal  to  r 

(iv)  t*  has  dual  right  row  rank  equal  to  r  . 


71 


3  3.1.  Ryist.pnpft  of  Rank  and  Rpinf.inn  to  ST.T 

We  now  discuss  the  existence  of  the  rank  of  a  template  and  the  relationship  of  rank  to 

SLI. 

Theorem  3.32.  Let  FG  {RiR"*"},  and  let  t  G  (F^)'^.  Then  there  is  an  integer  r  such 
that  t  has  (j)-astic  rank  r,  if  and  only  if  t  is  doubly  F-astic.  In  this  case,  r  satisfies  1  <  r  < 
min(m,n),  where  m  =1x1,  n  =1  YI. 

We  now  have  the  tools  to  show  that  the  previous  dimension  anomalies  are  avoided  in 
context  of  strong  linear  independence. 

Theorem  3.33.  Let  FG  {RfR"^},  X  an  arbitrary  non-empty,  finite  coordinate  set  with 
1x1  =  m.  Then  for  each  integer  n,  1  <  n  <  m,  we  can  find  n  images  on  X,  aj  G  F^  j  = 
l,...,n  which  are  SLI.  This  is  impossible  for  n  >  m. 

Permanents  and  Tnversps 

As  in  linear  algebra,  if  t  is  a  matrix  all  of  whose  eigenvalues  satisfy  1  Xl  <  1,  then  the 
expression 

(e  —  t)  ^  =  e+t+t"+ 
is  valid.  We  state  an  analogous  case  in  the  image  algebra. 

For  a  bounded  1-group  Fj.^,  a  template  t  G  (F^)^  is  called  increasing  if 

a  M  t  >  a  for  all  a  G  F^,  and  s  M  t  >  s  for  all  s  G  (F^)^ , 
where  Y  is  any  arbitrary  coordinate  set. 

We  have 

Lemma  3.34.  Let  be  a  bounded  l-group,  and  let  t  G  (Fi^)^.  Then  t  is  increasing  if 
and  only  if  tjj(x)  >  (^  V  x  G  X. 


72 


Let  t  G  (R^)^  be  a  template,  1x1  =  m.  We  define  the  permanent  of  t  to  be  the 
scalar  Perm(t)  G  R^oo  given  by 

m 

Perm(t)  =  V{Et  (x^i)), 

where  the  maximum  is  taken  over  all  permutations  <t  in  the  symmetric  group  of  order  m!. 

For  the  bounded  1-group  let  t  G  ((R^^)^)^  be  a  template,  Ixl  =  m.  We  define 
the  permanent  of  t  to  be  the  scalar  Perm(t)  G  R^  given  by 

Perm(t)  =  V  (.0  ty.(x<,(i)), 

where  again  the  maximum  is  taken  over  all  permutations  a  in  the  symmetric  group  S^,. 

The  adjugate  template  of  t  G  (F^)^  is  the  template  Adj(t)  defined  by 

[Adj(t)]y,(xj)  =  Cofactor[tj,,j(yi) 

where  Cofactor(t),,j(yi)  is  the  permanent  of  the  template  s  defined  by 

h  =  l,j-H,...,m  and  k  =  l,...,i— l,i-M,...,m. 

Here,  s  G  (F^^)"^,  where  I  W|  =  m-1.  For  m  =  1,  we  define  Adj(t)  =  <f>. 

3  Crnph  Theory 

We  now  present  some  graph  theoretic  tools  which  will  be  used  later. 

A  digraph  or  directed  graph  is  a  pair  D  =  {V,E}  where  V  is  a  finite  set  of  vertices 
{l,...,n}  and  E  C  V  x  V.  The  set  E  is  called  the  set  of  edges  of  D.  An  edge  (i,j)  is  directed 
from  i  to  j,  and  can  be  represented  by  a  vector  with  tail  at  node  i  and  head  at  node  j. 

A  graph  is  a  pair  G  =  {V,E}  where  V  is  a  finite  set  of  vertices  n}  and  E  C  V  x  V 


such  that  (i,j)  G  E  if  and  only  if  (j,i)  G  E. 


73 


A  u-v  path  in  a  digraph  or  graph  is  a  finite  sequence  of  vertices  u  =  yg,  Xi,  ...Jn,  =  v 
such  that  (xj,  Xj+i)  c  E  for  all  j  =0,...,in-l.  A  circuit  is  a  path  with  the  property  that 
Xo  =  Xn,.  A  simple  path  Xg,  Xi,  ...,Xn,  is  a  path  with  distinct  vertices  except  possibly  for  Xg 
and  Xn^.  A  simple  circuit  is  a  circuit  which  is  a  simple  path. 

A  weighted  digraph  {graph)  is  a  digraph  (graph)  to  which  every  edge  (i,j)  is  uniquely 
assigned  a  value  in  F^.  We  denote  the  weight  of  the  edge  (i,j)  by  t(i,j)  or  tjj.  Note  that  the 
value  tjj  is  not  necessarily  equal  to  the  value  tjj. 

We  remark  that  if  G  =  {V,E}  b  a  graph  then  if  there  exbts  a  u-v  path,  there  exbts  a 
v-u  path. 

With  each  path  (circuit )  (T  =  Xg,  y^,  ...,x^  of  a  weighted  graph  G,  there  b  an  associated 
path  (circuit)  product  p(<r),  defined  by 

t»  U  X  ty  u  X  *  *  *  X  ty  y  » 

For  each  template  t  G  (F2^)^  where  1x1  =  n,  we  can  associate  a  weighted  graph  A(t) 
in  the  following  way.  The  associated  graph  A(t)  b  the  weighted  graph  G  =  (V,E),  where  V 
=  {l,2,...,n},  and  whose  weights  are  txj(X|),  for  the  pair  (i,j)  such  that  Xj  G  5_oo(^.xj)-  The  pair 
(i,j)  b  then  considered  an  edge.  If  t^pc\)  =  — oo,  then  we  can  extend  E  to  all  of  V  X  V  by 
stating  that  (i,j)  G  E  with  null  weight  — oo.  An  example  of  a  template  t  and  its  associated 
weighted  graph  A(t)  b  given  below  in  Figure  7.  We  have  omitted  listing  the  values  of  — oo 


on  A(t).  Here,  1x1  =3. 


74 


X  = 


(a) 


Figure  7.  A  Template  and  its  Associated  Graph. 

(a)  A  Template  t;  (b)  Associated  Graph  A(t). 


For  the  belt  F_(x„  the  correspondence  is  one-one.  We  note  this  in  the  next  theorem. 

Lemma  2.48.  Let  be  a  belt,  where  -f  oo  ^  F.  Let  a  :  {  G  :  G  is  a 

weighted  graph  with  n  nodes  }  be  defined  by  a(t)  =  A(t).  Then  a  is  one-one  and  onto. 

Proof:  Suppose  Q(t)  =  a(s).  Let  { }  be  the  weights  for  A(fc)  and  (  Sxj(xi)  }  he  the 
weights  for  A(s).  By  definition,  t^j(xi)  =  Sxj(Xi)  for  all  i,j,  and  hence  t  =  s. 


75 


Now  suppose  that  G  ==  (V,E)  is  a  weighted  graph  with  weights  {  Wy  }.  Define 
t  G  (F2ij)^  by  txj(Xi)  =  Wjj,  if  (i,j)  G  E,  and  t^Xj)  =  —oo  otherwise.  Then  a(t)  = 

G. 

Q.E.D. 

Let  t  G  (F2^)^.  If  for  each  circuit  a  in  A(t)  we  have  p(<7)  <  <j)  and  there  exists  at  least 
one  circuit  (T  such  that  p(<7)  =  (ft,  then  we  call  t  a  definite  template. 

Lemma  3.35.  A  template  t  G  (F^)^  is  definite  if  and  only  if  for  all  simple  circuits 
a  G  A(t),  p(<7)  <  (ft  and  there  exists  at  least  one  such  simple  circuit  a  such  that  p((T)  =  <f>. 

Theorem  3.36.  Lett  G  (F^i,)^  be  either  row~(fhastie  or  column-(f>-astic.  Then  t  is 
definite. 

Theorem  3,37.  Let  t  G  (F^)^.  If  t  is  definite  then  so  is  for  any  integer  r  >  0. 

Let  t  G  (F2^)^  where  Ixl  =n.  The  metric  template  generated  by  tvs 

r(t)  =  tVt^  V  •  •  •  Vt“. 

The  dual  metric  template  is 

r*(t)  =  t*A(t2)*A  •••  A(t")*. 

The  name  metric  originates  from  the  application  of  the  minimax  algebra  to  transporta¬ 
tion  networks.  If  for  the  bounded  1-group  the  value  t^Xj)  represents  the  direct  distance 
from  node  i  to  node  j  of  a  transportation  network  with  t^pc\)  =  +oo  if  there  is  no  direct 
route,  then  ( r(t))*  represents  the  shortest  distance  matrix,  that  is,  (( r(t))*  )xj(Xi)  is  the  shor¬ 
test  path  possible  from  node  i  to  node  j  of  all  possible  paths.  A  description  of  a  transporta¬ 
tion  problem  concerning  shortest  paths  is  discussed  in  Cuninghame-Green’s  book  [39j. 


76 


Theorems  3.38  through  3.40  are  used  to  prove  Theorem  3.41. 

Lemma  3.38.  Then 

r(t)  =  (1  vt)"-^  M  t. 

Lemma  3.39.  (tVl)""^  =  IVtV  •••  V  t''-\  t  € 

Theorem  3.40.  Lett  G  (F^)^  be  definite.  Then 

<  r(t),r  =  l,2,... 

Theorem  3.41.  Let  t  G  (F^)^  be  definite.  Then 

[r(t)r  <  r(t) ,  r  =  1,2,... 
r(t)  =  (IVty  M  t  ,r  =  l,2,...,n-l . 

Using  the  Adjugate  of  a  template,  we  have 

Theorem  3.42  (53).  Let  be  a  commutative  bounded  l-group  and  t  G  (F^)^  be 
definite  and  increasing.  Then  Adj(t)  =  r{t). 

Now  we  define  the  inverse  of  a  template.  For  t  G  (F^)^,  we  define 

Inv(t)  =  (Perm(t))”‘  0  Adj(t)  (or  Inv(t)  =  (Perm(t))“^  ©  Adj(t)  ) 
by  direct  analogy  in  elementary  linear  algebra. 

We  note  that  the  template  Inv(t)  is  not  necessarily  invertible  in  the  sense  that 
Inv(t)  0  t  =  1,  for  example. 

3  ■'^■4.  Tnvprtibility 

In  order  to  define  an  invertible  template,  that  is,  a  template  t  G  (F^)^  that  has  the 
property  that  there  exists  a  unique  template  s  satisfying  b0  s=s0  t=I 
(t  ®  s  =  3  ®  t  =  1),  we  need  to  introduce  the  concept  of  equivalent  templates. 


77 


Let  Fj.00  be  a  subbelt  of  or  R-ioo-  ^  template  p  €  ( F^)  ^  is  said  to  be  invertible  if 
there  exists  a  template  q€(F^)^such  that  p  El  q=qEI  p=l  (p®  q=q®  p=l). 

These  templates  can  be  described  in  close  detail.  Let  us  define  a  strictly  doubly  F-astic 
template  over  a  bounded  1-group  F^o^  to  be  an  element  t  of  (F^)^  satisfying 

(i)  tyj(xj)  <  -boo,  i,j  =  l,...,n 

(ii)  for  each  index  i  there  exists  a  unique  inder:  j;  G  {l,2,...,m  }  such  that  ty|(xjj)  is  finite. 

Theorem  3.43.  Let  F^gg  ®  bounded  Ugroup  with  group  F  and  let  p  €  (F^)^  be  given. 
Then  p  is  invertible  if  and  only  if  p  is  strictly  doubly  F~astic. 

As  is  usual,  if  p  is  invertible,  then  the  template  q  above  is  written  as  p~^ 

The  intersection  of  the  set  of  strictly  doubly  Mastic  templates  and  the  set  of  strictly 
doubly  F-astic  templates  we  call  the  permutation  templates.  It  is  not  difficult  to  show 

Proposition  3.44.  Let  F^gg  be  a  bounded  Ugroup.  Then  the  set  of  invertible  templates 
from  X  to  X,  wherel'X.l  =  m,  form  a  group  under  the  multiplication  M  (®),  containing  1  as 
the  identity  element  and  having  the  permutation  templates  as  a  subgroup  isomorphic  to  the 
symmetric  group  S^,  on  m  letters. 

Pre-  or  post-multiplication  of  a  template  t  by  a  permutation  template  p  will  permute 
the  images  or  the  images  ty  of  t,  respectively,  and  these  permutation  templates  play  a 
role  exactly  like  their  counterparts  in  linear  algebra. 

.'^  3  5.  Eqnivalencp  of  Templates 

Let  be  a  bounded  l-group,  and  let  t,  s  £  (F^)'"  be  given.  We  say  that  t  and  s 
are  equivalent,  written  t  =s,  if  there  exist  invertible  templates  p  €  (F^)'  and 
q  €  (F^)^  such  that  pMtM  q  =  3(p®t®q  =  s). 


78 


Now  we  define  elementary  templates.  An  elementary  template  p  G  (F^)^  over  a 
bounded  1-group  with  group  F  is  one  of  the  following; 

(i)  a  permutation  template 

(ii)  a  diagonal  template  of  the  form  diag(<^, ,  ^),  where  a  €  F. 

Elementary  templates  correspond  to  matrices  which  perform  elementary  operations  on 
matrices  [39].  A  permutation  template 

1.  permutes  the  images  of  t;  or 

2.  permutes  the  images  ty  of  t, 

depending  on  whether  the  multiplication  is  from  the  left  or  right,  respectively.  Diagonal 
templates  of  the  type  listed  in  (ii)  above  have  the  effect  of  multiplying  some  image  t'^  of  t  by 
a  finite  constant  ot,  or  multiplying  some  image  ty  of  t  by  a  finite  constant  a,  depending  on 
whether  the  multiplication  of  t  is  from  the  left  or  right,  respectively. 

Lemma  3.45.  Let  be  a  bounded  Ingroup,  and  let  X  and  Y  be  given  coordinate  sets, 
1x1  =m,l  YI  =n.  Then  the  relation  of  equivalence  is  an  equivalence  relation  on  (F^)^.  If 
t,sG(F^)\  then  t  =  s  i/  and  only  if  there  is  a  sequence  of  templates  UqiUj,  .  .  .  ,Uj  such 
that  Uq  =  t  and  Uj  =  s,  and  Up  is  obtained  by  an  elementary  operation  on  Up_i,  p  = 

Permutation  and  diagonal  templates  of  this  form  will  play  an  important  role  in  the  dis¬ 
cussion  on  local  template  decompositions,  as  well  as  the  following  theorem. 

Lemma  3.48.  Let  T^gg  be  a  bounded  l-group  with  group  F  and  let  t  €  (F^)^  be  given. 

If  a  given  image  of  t'  (or  t)  is  F-astic  then  t  is  equivalent  to  a  template  in  which  that  image 
oft'  fort)  is  (b-astic  and  all  other  images  in  t'  (or  t)  are  identical  with  the  corresponding 
image  in  t'  (or  t).  Hence  if  t  is  (row-,  column-,  or  doubly)  F-astic  then  t  is  equivalent  to  a 
template  which  is  (respectively  row-,  column-,  or  doubly)  <f)-astic. 


Fqiiiviilpnpft  anH  rank.  The  following  are  results  which  show  the  relation  between 


equivalence  and  rank. 

Proposition  3.47.  Let  be  a  bounded  l-group,  and  /ct  t ,  s  G  (F^)^.  Then  t  has  <f)- 
astic  rank  equal  to  r  if  and  only  if  the  following  statement  is  true  for  j  =  r  but  not  /or  j  >  r : 

t  is  equivalent  to  a  doubly  <l>-astic  template  d  which  contains  a  strictly  doubly  <f>- 
astic  template  u  G  where  I  W|  =  j. 

Corollary  3.48.  Let  F^^o  be  a  bounded  l-group  with  group  F  and  lett,s  E  (F^)^  be 
equivalent.  Then  if  either  t  or  s  has  a  rank,  then  so  does  thr  other,  and  the  ranks  are 
equal. 

^  4  Thp  F.igpnprnhlpm  in  t.hp  Tmagp  AIgphra. 

Using  the  isomorphism,  we  can  discuss  the  eigenproblem  which  is  presented  in  its 
matrix  form  [39]  in  context  of  the  image  algebra.  In  this  section  we  present  the  eigenproblem 
and  solution  in  image  algebra  notation. 


•1.4.1 _ THr  Stfttpmpnt  in  Image  Algebra 

Unless  otherwise  stated,  we  assume  that  F  is  a  subbelt  of  either  R  or  R'^,  and  let 
F_oo>  and  F+oo  have  their  usual  meanings.  The  coordinate  sets  X  and  Y  are  assumed  to  be 
non-empty,  finite  arrays,  with  1  Xl  =  m  and  I  yI  =  n. 

Let  X  G  Fjj.00.  Let  X  G  (F^)^  be  the  one-point  template  defined  in  the  usual  way  by 


Xy(x) 


X  X  =y 
— oo  otherwise  ' 


Suppose  F  is  a  subbelt  of  R,  and  t  G  (F^)"\  Then  the  eigenproblem  is  to  find 


80 


a  Q  t  =  a  M  X  . 

Similarly  for  the  operation  © ,  we  need  find  a  G  and  X  G  F such  that 

a  ©  t  =  a  ©  X  . 

For  either  belt,  if  such  a  and  X  exist,  then  a  is  called  an  eigenimage  of  t,  and  X  a  correspond¬ 
ing  eigenvalue.  The  eigenproblem  b  called  finitely  soluble  if  both  a  and  X  are  finite. 

As  mentioned  before,  all  results  of  thb  section  are  applicable  for  F  a  subbelt  of  with  R 
or  R*^.  Hence,  to  avoid  stating  all  results  for  both  belts  separately,  we  will  state  the  results 
for  M  with  the  understanding  that  in  all  theorems,  definitions,  etc.  in  thb  section  of  Chapter 
3,  with  the  exception  of  Theorem  3.57,  □  can  be  replaced  by  ©  everywhere  and  the 
theorems  and  results  will  still  hold. 

Theorem  3.49.  Let  t  G  (F^)^.  Then  there  exist  s  G  (FJoo)^  that  ifh  is  in  the 
column  space  of  t,  then  b  is  an  eigenimage  of  a  with  corresponding  eigenvalue  <i>.  Here, 
s  =t*  23  t  G  (F^)^.  Hence,  bQt=bQla=b. 

Theorem  3.50.  Let  t  G  (F^)^.  If  the  eigenproblem  for  t  is  finitely  soluble,  t  must  be 
row-F-astic.  In  particular,  if  t  is  row-<f>-astic,  then  the  eigenproblem  for  t  is  finitely  soluble, 
in  which  case  \  =  <f>. 

Let  t  G  (F^)^  be  definite.  We  know  that  A(t)  has  at  least  one  circuit  cr  such  that 
p((7)  =  <l>.  An  eigennode  of  A{t)  is  any  node  on  such  a  circuit.  Two  eigennodes  are 
equivalent  if  they  are  both  on  any  one  such  circuit. 

Lemma  3.51.  Lett  G  (Fi^)^  be  definite.  Then  r(t)  is  definite,  and  if]  is  an  eigennode 
of  A(t),  then 

(nt))xj(Xj)  =  <f>. 

Conversely,  if  ( r(t)  )xj(xj)  =  (j)  for  some  Xj  G  X,  then  j  is  an  eigennode  of  A(t). 


I 


81 

Lemma  3.52.  Lett  G  (F^)^  he  definite.  If  I  is  an  eigennode  of  A(t)  then 

M  t  =  a’  □  1  =  a^ 

where  a)  is  the  image  [  r(t)  ] 

Thus,  images  ( r(t)  where  j  is  an  eigennode  give  us  eigenimages  for  the  template  t, 

with  corresponding  eigenvalue  <t).  For  a  given  t,  the  set  of  all  such  images  are  called  the  fun¬ 
damental  eigenimages  for  t.  Just  as  in  the  case  for  matrices,  two  fundamental  eigenimages 
are  called  equivalent  if  nodes  j  and  h  are  equivalent,  and  otherwise  the  eigenimages  are  non- 
equivalent. 

Theorem  3.53.  Lett  €  (F^)^  be  definite.  Ifs),  a*^  6  F^  a,-e  fundamental  eigenvectors 
of  t  corresponding  to  equivalent  eigennodes  j  and  k,  respectively,  then 

a}  =  a*'  C3  or, 

where  or  G  F,  and  a  G  (F^)^  is  the  one-point  template. 


If  t  G  (F^)^  is  definite,  let  {a^‘,  . . .  ,  a^*'}  be  a  maximal  set  of  non-equivalent  funda¬ 
mental  eigenimages  of  t.  The  space  <  a^‘ . a^'‘  >  generated  by  these  eigenimages  is 

called  the  eigenspace  of  t. 

Theorem  3.54.  Let  t  G  (F^)^  be  given.  If  the  eigenproblem  for  t  is  finitely  soluble  then 
every  finite  eigenimage  has  the  same  unique  corresponding  finite  eigenvalue  >  The  tem¬ 
plate  t  M  —X  is  definite,  and  all  finite  eigenimages  of  t  lie  in  the  eigenspace  oftM  —X. 
The  non-equivalent  fundamental  f  igenimages  which  generate  this  space  have  the  property 
that  no  one  of  them  is  linearly  dependent  on  (any  subset  of)  the  others. 


82 


The  unique  scalar  in  Theorem  3.54,  when  it  exists,  is  called  the  principal  eigenvalue  of 
t. 

VVe  call  a  bounded  1-group  F  radicable  if  for  each  a  €  F  and  integer  k  >  1,  there  exh-ts 
a  unique  f  G  F  such  that  f'‘  =  a. 

Some  examples  of  radicable  bounded  1-groups  are  R±oo>  Q±ooi  ^tnd  However, 
is  not  radicable.  Choosing  a  =  12  and  k  =  5,  solving  for  f  in  the  equation 

f^  =  12 

is  just  solving  for  f  in  (using  regular  arithmetic) 

5f  =  12 

which,  of  course,  has  no  integral  solution. 

Let  F  be  a  radicable  bounded  1-group,  and  t  G  (F^)^.  Let  a  =  y^,  yy,  ...^y^  be  a  cir¬ 
cuit  in  A(t).  We  define  the  length  of  a  to  be  m.  For  each  circuit  a  in  A(t),  of  length  /(cr) 
and  having  circuit  product  p(<T),  we  define  a  circuit  mean  fi{cr)  G  F  by 

bwfw  =  pm. 

We  also  define 

X(t)  =  V  {  li{cr) :  O’  is  a  simple  circuit  in  A(t) } . 

For  the  template  and  associated  graph  A(t)  in  Figure  8,  we  have  the  following  compu¬ 


tations. 


83 


Simple  Circuit  a 

p(<^) 

/(a) 

l^{^) 

(1.1) 

4 

1 

4 

(2,2) 

-1 

1 

-1 

(3,3) 

7 

1 

7 

(1,2,1) 

5 

2 

5/2 

(2,3,2) 

—00 

0 

—00 

(3,1,3) 

— oo 

2 

— oo 

(1, 2,3,1) 

1 

3 

1/3 

(3,2,1,3) 

—00 

3 

— oo 

Figure  8.  Computation  of  the  Circuit  Mean 

In  this  example,  X(t)  =  7. 

3  4  3  Snliit.inns  t.n  t.hp  F.igpnprnhipm 

We  now  present  the  relation  between  the  parameter  X(t)  and  the  principal  eigenvalue 

for  t. 

Theorem  3.55.  Let  be  a  radicable  bounded  l-group  and  let  t  G  be  given.  If 

the  eigenproblem  for  t  is  finitely  soluble  then  X(t)  is  finite,  and,  in  this  case,  X(t)  is  the  only 
possible  value  for  the  eigenvalue  in  any  finite  solution  to  the  eigenproblem  for  t.  That  is, 
X(t)  is  the  principal  eigenvalue  of  t. 

Theorem  3.56.  Let  ®  radicable  sub-bounded  l-group  o/R^„  and  let  t  G  (R^)^ 

be  given.  Then  the  eigenproblem  for  t  is  finitely  soluble  if  and  only  if  X(t)  is  finite  and  the 
template  B(yl)  is  doubly  F-astic,  where  A  = 

{[r(tEa  ~X(t))]'  ,  (r(t!S]  “X(t))]'  ,  .  .  .  ,  [r(tEl  “X(t))]'  }  is  a  maximal  set  of  non- 
equivalent  fundamental  eigenimages  for  the  definite  template  t  E3  — X(t). 


The  Compiitatinna.!  Taak.  If  Ixl  is  large,  and  t  G  (F^)^,  then  to  directly  evaluate  the 
circuit  product  for  all  simple  circuits  in  t  is  very  time  consuming.  We  now  state  a  theorem 


which  makes  the  task  more  manageable  for  the  case  where  the  bounded  1-group  is 

Theorem  3.57.  Let  t  G  (F^)^  be  given.  If  the  eigenproblem  for  t  is  finitely  soluble, 
then  X(t)  is  the  optimal  value  of  X  in  the  following  linear  programming  problem  in  the  n-t-1 
real  variables  X,  Xi,...,Xn: 

Minimize  X  Subject  to  X  -I-  X;  —  Xj  >  t,j|(Xj) 

where  the  inequality  constraint  is  taken  over  all  pairs  i,j  for  which  t.{|(Xj)  is  finite. 

In  Theorem  3.54,  we  noted  the  linear  independence  of  the  fundamental  eigenimages 
which  generate  an  eigenspace.  We  are  able  now  to  prove  a  stronger  result  which  has  appli¬ 
cations  to  and  R^,,. 

Theorem  3.58  Let  F^^o  radicable  bounded  l-group  other  than  F3,  and  let  t  6(F^)^ 
have  a  finitely  soluble  eigenproblem.  Then  the  fundamental  eigenimages  of — X(t)  13  t 
corresponding  to  a  maximal  set  of  non-equivalent  eigennodes  in  A[— X(t)  0  t]  are  SLI. 

We  now  present  a  result  relating  X(t)  and  Inv. 

Theorem  3.59.  Let  F^^o  be  a  bounded  l-group  and  t  G  (F^)^  be  such  that  X(t)  <  <f>. 
Then 

Inv(lVt)  =  1  Vt  Vt' V  •  •  •  Vt*^ 
for  arbitrary  large  K.  Here,  1  denotes  the  identity  template  o/(F^)^. 


85 


CHAPTER  4 

GENERALIZATION  OF  MATHEMATICAL  xMORPHOLOGY 

Up  until  the  mid  lOGO’s,  the  theoretical  tools  of  quantitative  microscopy  as  applied  to 
image  analysis  were  not  based  on  any  cohesive  mathematical  foundation.  It  was  G.  Math- 
eron  and  J.  Serra  at  the  ^cole  des  Mines  de  Paris  who  first  pioneered  the  theory  of 
mathematical  morpbolog}'  as  a  first  attempt  to  unify  the  underlying  mathematical  concepts 
being  used  for  image  analysis  in  microbiology,  petrography,  and  metallography  [16,53,54). 
Initially  its  main  use  was  to  describe  boolean  image  processing  in  the  plane,  but  Sternberg 
[55]  extended  the  concepts  in  mathemati'-vl  morphology  to  include  gray  valued  images  via 
the  cumbersome  notion  of  an  umbra.  While  others  including  Serra  [56,57]  also  extended  mor¬ 
phology  to  gray  valued  images  in  different  manners,  Sternberg’s  definitions  have  been  used 
more  regularly,  and,  in  fact,  are  used  by  Serra  in  his  book  [I6j. 

The  basis  on  which  morphological  theory  lies  are  the  two  classical  operations  of  Min¬ 
kowski  addition  and  Minkowski  subtraction  from  integral  geometry  [13.14].  For  any  two  sets 
A  C  R"  and  B  C  R",  Minkowski  addition  and  subtraction  are  defined  as 

Ax  B  =  U  Ak  and  A  /  B  =  fl  .A,.. 

bCB  b€B'  ■ 

respectively,  where  .Aj,  —  {a  -f  b  :  a  G  A  }  and  B'  =  {  — b  :  b  G  B  }.  We  have  used  the  ori¬ 
ginal  notation  as  found  in  Hadwiger’s  book  [14].  It  can  be  shown  that 

A  /  B=(.V=  X  B')'. 

where  .A'^  denotes  the  complement  of  A  in  R".  From  these  definitions  are  constructed  the 
two  morphological  operations  of  dilation  and  erosion.  As  used  by  Serra  and  Maragos  [16.21], 


86 


the  dilation  of  a  set  A  C  R"  by  a  structuring  element  B  C  R“  is  denoted  by  A  ffl  B'  and 
defined  by 


AfflB'=  U  Ak 

b6B'  “ 

while  erosion  of  A  by  B  is 

ABB=  n  Au  =  (A-^HB)'. 

b6B'  “  ^ 

We  remark  that  the  actual  symbols  used  in  Serra’s  and  Maragos’  papers  for  the  dilation 
and  erosion  are  ©  and  0.  To  avoid  confusion  with  the  image  algebra  operation  0,  we  have 
replaced  0  and  ©  with  EB  and  ^  respectively. 

To  avoid  anomalies  without  practical  interest,  the  structuring  element  B  is  assumed  to 
include  the  origin  0  G  R",  and  both  A  and  B  are  assumed  to  be  compact.  Unfortunately,  the 
definitions  for  dilation  and  erosion  defined  by  Serra  are  not  the  same  as  the  Minkowski 
operations.  In  addition,  while  Maragos  uses  the  same  definitions  as  Serra  for  dilation  and 
erosion,  Maragos  [21]  uses  the  identical  symbols  Band  Bwhen  defining  Minkowski  addition 
and  subtraction.  To  add  to  the  confusion,  Sternberg  defines  an  erosion  and  dilation  using  the 
same  symbols  Sand  ^  which  are  exactly  the  Minkowski  operations  [59|.  The  following  table 
lists  the  three  definitions.  In  all  cases,  Aij  =  {a  +  b  :  aGA},  B'  =  {— b  :bGB},  and  A' 
denotes  the  complement  of  A  in  R". 


87 


Table  2. 


addition 

subtraction 

Minkowski 

Ax  B  =  U  Aw 

b6B  “ 

A  /  B  =  n  Ab  =  (A' SB')' 

dilation  of  A  by  B 

erosion  of  A  by  B 

Serra 

Maragos 

ABB'=  U  Au 

b€B'  ** 

ASB'=  n  Ab  =(A®S  B)*-' 

b6B'  “  ’ 

dilation  of  A  by  B 

erosion  of  A  by  B 

Sternberg 

A  SB  =  U  Ah 
b6B  “ 

ABB=  n  A.  =  (A' SB')' 

b6B' 

Thus  we  see  that  while  Sternberg’s  dilation  of  A  by  B  is  exactly  Minkowski’s  addition 
of  A  and  B,  Serra’s  dilation  of  A  by  B  is  Minkowski’s  addition  of  A  and  B'.  Although  both 
definitions  of  erosion  of  A  by  B  are  equivalent  to  Minkowski’s  subtraction  of  A  and  B,  Serra 
uses  the  symbol  B'  while  Sternberg  uses  simply  B.  For  the  remainder  of  this  chapter  we  will 
use  Sternberg’s  definitions  of  dilation  and  erosion. 

All  morphological  transformations  are  combinations  of  dilations  and  erosions,  such  as 
the  opening  of  A  by  B,  denoted  by  A  o  B, 

AoB  =  (A  SB)  SB 
and  the  closing  of  A  by  B,  denoted  by  A  •  B, 

A*B  =  (ASB)SB. 

However,  a  more  general  image  transform  in  mathematical  morphology  is  the  Hit  or  Miss 
transform  [55,54],  Since  an  erosion  and  hence  a  dilation  is  a  special  case  of  the  Hit  or  Miss 


88 


transform,  this  transform  is  often  viewed  as  the  universal  morphological  transformation  upon 
which  the  theory  of  mathematical  morphology  is  based.  Let  B  =  (D,E)  be  a  pair  of  structur¬ 
ing  elements.  Then  the  Hit  or  Miss  transform  of  the  set  A  is  given  by  the  expression 

A©B  =  {a:D^C  A,  E^C  A'}. 

For  practical  applications  it  is  assumed  that  D  D  E  =  0.  The  erosion  of  A  by  D  is  obtained 
by  simply  letting  E  =  0,  in  which  case  we  have  A  ©  B  =  A  □  D. 

While  there  have  been  several  extensions  of  the  boolean  dilation  to  the  gray  level  case, 
Sternberg’s  formulaes  for  computing  the  gray  value  erosion  and  dilation  are  the  most 
straightforward,  although  the  underlying  theory  introduces  the  somewhat  extraneous  concept 
of  an  umbra.  Let  f:  R“  — +>  R  be  a  function.  Then  the  umbra  of  f,  denoted  by  U{i),  is  the  set 
U{1)  C  R®"^^  defined  by 

Z/(f)  =  {p=(x,z)€R"+i:z<f(x)}. 

Again,  the  notion  of  an  unbounded  set  is  e.xhibited  in  this  definition,  for  in  general  the  value 
z  can  approach  — oo.  Since  Z/(f)  C  R’',  the  dilation  of  two  functions  f  and  g  is  defined 
through  the  dilation  of  their  umbras, 

Z/(fag)  =  U{i)mu{g), 

and  similarly  the  erosion  of  f  by  g, 

:/(fSg)  =  U{i)BU{g). 

.A.ny  function  d:  R"  — ►  R  has  the  property  that  d(x)  =  max{z  G  R  :  (x,z)  G  ll{d)},  and 
thus  the  set  Zi(f  Sg)  well-defines  the  function  f  Bg.  However,  when  actually  calculating  the 
new  functions  d  =  f  Bg  and  e  =  f  Sg,  Sternberg  gives  the  foilowing  formuiae  for  the  two- 
dimensional  dilation  and  erosion,  respectively: 


89 


d(x,y)  =  ( f(x  -  i,  y  -  j)  +  g(ij)  ]  (4.I) 

e(x,y)  =  min  ( f(x  -  i,  y  -  j)  -  g(-i,-j)  ]  (4.2) 

The  function  f  represents  the  image,  and  g  represents  the  structuring  element.  Both  f  and  g 
are  assumed  to  have  finite  support,  with  values  of  —00  outside.  Also,  in  general  the  support 
of  g  is  much  smaller  than  the  coordinate  set  X,  and  g(0)  ^  —00.  So  in  practice,  the  notion 
of  an  umbra  need  not  be  introduced  at  all. 

Note  that  when  applying  these  transforms  to  real  data,  we  cannot  simply  substitute  an 
image  a  for  the  set  A,  as  the  expression  A'  becomes  meaningless  to  a  computer.  What  is 
actually  assumed  is  that  A  corresponds  to  the  black  pixels  in  a  boolean  image  a,  that  is, 
given  A  C  R",  a  coordinate  set  XCR"  is  chosen  and  a  two-valued  image  a  on  X  is  found, 
where  1  and  0  represent  the  two  values; 


I  1  ifxSACX 

\  0  otherwise 

For  the  two-dimensional  gray  value  case,  Sternberg’s  formulas  (4-1)  and  (4-2)  are  easily  writ¬ 
ten  in  computer  code,  and  this  is,  in  fact,  close  to  the  image  algebra  definition  for  dilation. 

In  short,  when  implementing  a  problem  which  is  posed  in  morphological  terms,  the  solution 
must  be  reposed  in  a  setting  which  more  closely  represents  the  computing  environment.  On 
the  other  hand,  it  has  been  established  that  the  image  algebra  comes  very  close  to  ideally 
modeling  a  large  number  of  important  image  processing  problems,  such  as  mapping  of 
transforms  to  sequential  and  parallel  architectures  [45]  and  this  dissertation,  and  expressing 
sequential  algorithms  in  a  parallel  manner  [60]. 

The  next  part  of  this  chapter  is  devoted  to  establishing  an  isomorphism  between  the 
morphological  algebra  and  the  image  algebra.  We  will  show  that  performing  a  dilation  is 
equivalent  to  calculating 


90 


a  IS  t 

for  the  appropriate  a  and  t,  and  performing  an  erosion  is  equivalent  to  calculating 


a  G3  t* 

for  appropriate  a  and  t. 

Let  A,B  be  finite  subsets  of  Z",  where  B  is  a  structuring  element.  Let  X  =  Z"  or 
choose  X  C  Z"  to  be  a  finite  set  such  that  A  SB  C  X  Let  F4  denote  the  value  set  {  —00,  0 
1,  +00  }.  Define  2*“  — by  ^(A)  =  a  where 


a(x)  = 


1  if  X  G  A 
0  otherwise  ‘ 


Let  S  =  {BCZ":|b|  <00  and  0GB},  and  let  I  be  the  set  of  all  F4  valued  invariant 
templates  from  X  to  X  such  that  y  G  Define  S  — +•  J  by  jj(B)  =  t  where 


ty(x) 


^  I  0  if 

[  —00  ol 


xGB' 


otherwise  ’ 


Lemma  4.1.  Let  rj  be  as  above.  Let  A  C  Z",  andB  £  B  a  structuring  element.  Then 


e(A  SB)  =  e(A)  M  »KB). 

Proof;  Choose  X  large  enough  such  that  A  SB  C  X  Let  D  —  /  . ' ;  B  and 

f  =  ^(A)  Q  r?(B).  We  must  show  that  y  G  D  if  and  only  if  f(y)  =  1.  To  this  end, 
we  note  that 


y  G  A  B  B  <=t»  y  G  Ab  for  some  b  G  B  <=>  y  =  x  +  b  for  x  G  A,  b  G  B 
<=>  X  =(-b) +y,  — b  G  B',  X  G  A  <=>  x  G  A  and  x  =(— b) +y  G  By 
<=^  a(x)  =  1  and  ty(x)  =  0  <=s>  =  f(y)  =  i  ■ 

Q.E.D. 

We  call  a  the  image  corresponding  to  A,  and  t  the  template  corresponding  to  the 


structuring  element  B. 


91 


The  next  lemma  shows  the  correspondence  between  the  0  operation  and  erosion. 
Lemma  4.2.  Let  t]  be  as  above.  Let  A  C  Z",  and  B  C  Z"  o  structuring  element.  Then 

C(AgB)  =  1(A)  0  lr?(B)r. 

Proof:  Let  D  =  A  □  B  and  let  c  =  ^(A)  0  [  f;(B)  )*.  We  must  show  that  y  G  D  if  and  only 
if  c(y)  =  1. 

y  G  D  <=>  y  G  Ap  V  p  G  B'  <=>  y  =  Xp  +  p  V  p  G  B', 
where  the  choice  of  Xp  G  A  depends  on  p.  Let  a  =  ^(A)  and  t  =  t}{B).  Then 
c  =  a  0  t*  and 

c(y)  =  A  a(x) +'t‘(x)  =  A  a(x)+t*(x). 

We  have 

,  .  f  0  if  y  €  B';, 

ty(x)  —  [ t^(y)  1  —  I  otherwise' 

We  claim  that  S+„{ty)  =By.  To  show  this,  note  that 

x€5^<„(ty)  <ts>  ty(x)  =0  =t^(y)  <::*>  yGB';, 

y  =  p  +  X  for  some  p  G  B'  «s=>  x  =  b  +  y  for  some  b  G  B  <=>  x  G  By . 
Thus, 

y  G  D  y=Xp+pVpGB'  c=>  x^  =  b  +  y  V  b  G  B.  for  some  Xj,  G  A 
<=>  b+y=xGAVbGB  «s=>  By  =  S^^(ty)  C  A  ( by  definition  of  B)  <=t> 
a(x)  =  lVxGByCA  and  iy(x)  =  0  V  x  G  By  =  S^ooi^y) 

A  a/x)  + 1 ‘(x)  =  1  =  c(y) 


Q.E.D. 


Lemmas  4.1  and  4.2  include  not  only  boolean  but  gray  level  dilation  and  erosion.  How* 
ever,  we  now  show  explicitly  that  Sternberg’s  formulas  (4-1)  and  (4-2)  hold  in  the  two  dimen¬ 
sional  case.  Let  f,g  :Z“  — *■  R_^  be  two  real  e.xtended  real  valued  functions  with  finite  sup¬ 
port,  where  f  represents  the  image  and  g  the  structuring  element.  Choose  X  to  be  either 
or  a  finite  subset  containing  the  support  of  f  IS  g.  Then  ^  —»■  is  the  identity 

function  restricted  to  X,  that  is,  a(x)  =  f(x),  where  a  =  ^(f).  Let  B  denote  the  support  of  g, 

B  =  {  X  €  X  :  g(x)  j^—oo  }.  Let  I  denote  the  set  of  all  R^^  valued  templates  from  X  to  X 
such  that  y  €  S_oo(ty)  for  all  y  €  X.  Define  r) ;  R^^  — ►  J  by  7j(g)  =  t  where 

jfxGD'y 

^  ~  (  — oo  otherwise ' 

Note  that  if  x  €  By,  then  x  =  p  +  y  for  some  p  G  B',  which  implies  that  g(— p)  —  g(y  — x)  is 
well-defined.  Also,  S_oo(ty)  ssBy.  The  formal  relation  between  the  gray-scale  morphological 
operations  and  □  and  0  are  shown  in  the  next  two  theorems. 

Theorem  4.3.  Let  f,g,  B,  and  X  be  as  above.  Then 

s"(fHg)  =  e(f)ffl;7(g)- 

Proof: 

At  location  (x,y)  €  X  C  Z", 

(a  0  t)(x,y)  =  y  a(z)-ft(^,y)(z)  =  V  a(i,j) -I- t(,,y)(i.j) , 
while  at  (x,y),  f  fflg  =  d  has  value 

d(x,y)  =  max  (f(x-i,y -j)4-g(i,j))  =  max  (f(i,j)^-g(x-i.y-j)l• 

(|,J)€B  {x-i.y-j)eB 

Given  (x  -i,y  -j)  =  (-Pi.-po)  €B,  we  have  (i.j)  =  (pj ,  po)  +  (x,y)  €  and  hence, 
f{i,j)+g(x-i,y-j)=a(i,j)-l-t(,,,y)(i,j)  V  (i,j)  €  B'(x,y). 

Therefore, 


I 


93 


d(x,y)  =  max  +g(x-i,y-j)l  =  max  (f(i,j)  +  g(x-i,y  ~j)l 

==  +  W‘-j)  =  (aEI  t)(x,y). 

('.j)€S_oo(W 

Q.E.D. 

If  the  template  t  corresponding  to  the  structuring  element  g  has  form 

,,  I  g(y-x) 

[  —00  otherwise’ 

then  the  template  t*  has  form 

^yv*/  +00  otherwise ' 

Since  ty(x)  €  R  if  and  only  if  x  — y  €  B  if  and  only  if  x  G  By,  S^„(ty)  =  By. 

Theorem  4.4.  Let  f,g,  B,  and  X  be  aa  in  Theorem  4.3.  Then 

?(fBg)  =  e(OSKs)r- 

Proof: 

At  location  (x,y)  G  X  C  Z', 

(a0  t*)(x,y)  =  A  a(z) +' =  A  a{i,j)  +  , 

while  at  (x,y),  f  □  g  =  e  has  value 

e(x,y)  =  mm„(f(x -i,y -j) -g(-i-j)l  min  :f(i,j)  -  g(-x+i,-y +j)l . 

(i,j)€B'  (-.x+i,-y+j)6B 

Given  (— x+i,— y+j)  ={bi,b2)  GB,  we  have  (i.j)  =(bi,b._.)  +(x.y)  G  B.  and.  hence, 
f(ij)  -  g(-x+i,-y  +j)  =■  a(i,j)  +  t(^,y)(i,j)  V  (i,j)  € 

Therefore, 

e(x,y)  =  min  ,^„[f(i,j) -g(-x+i,-y+j)!  =  min  if(i,j) -- g(-x+i,-y +j)j 

(-x+i,-y+j)6B  (>.j)eB(xy) 

=  A  a(i,j) +t,*  ,(i,j)  =  (a  E3  t*)(x.y). 


Q.E.D. 


94 


It  is  easily  ascertained  that  each  of  ^  and  t)  are  one-one  and  onto  for  each  of  the  boolean 
and  gray  level  cases.  The  functions  ^  and  rj  therefore  preserve  the  morphological  operations, 
and  in  fact  theorems  4.1  through  4.4  show  that  mathematical  morphology  is  embedded  into 
the  image  algebra.  We  condense  the  results  in  the  following  two  expressions. 

a  M  t  corresponds  to  the  dilation  of  f  by  g,  f  S  g 

a  C3  t*  corresponds  to  the  erosion  of  f  by  g,  f  B  g 

The  operation  ®  can  also  be  used  to  express  a  boolean  dilation  or  erosion.  Given  A 
C  R",  the  image  a  €  {0,1  corresponding  to  A  is  defined  as  before,  while  for  a  structuring 
element  B  C  R“,  the  template  t  corresponding  to  B  is  defined  by 

I  1  ifxeB'y 

[  0  otherwise ' 

The  a  ®  t  corresponds  to  the  dilation  of  A  by  B,  while  a  ®  t  corresponds  to  the  erosion  of 
A  by  B.  Here,  of  course, 

.  1  ifyes', 

~  I  +QQ  Otherwise" 

Thus,  the  value  set  {  — oo,  0,  1,  +oo  }  along  with  the  operation  M  can  be  used  to  express  a 
boolean  dilation,  where  both  input  image  a  and  output  a  M  t  are  {  0,1  }  valued  images. 
Similarly,  the  three  element  blog  F3  =  {  0,1, +00  }  along  with  the  operation  ®  can  be  used 
to  express  a  boolean  dilation,  where  both  a  and  a  ®  t  are  {  0,1  }  valued  images. 

In  the  boolean  case  it  is  a  simple  e.xercise  to  show  that  the  Hit  or  Miss  transform  can  be 
expressed  as 


A©B=(ABD)n(A*BE). 


(4-3) 


95 


If  we  let  t  be  the  boolean  template  corresponding  to  D  and  s  the  boolean  template 
corresponding  to  E,  we  obtain  the  equivalent  image  algebra  expression 

(a  0  t*)  *  (Xo(a)  0  8*),  (4-4) 

or,  using  the  bounded  1-group  F3  =  {0, 1 ,  -foo}, 

(a®  t)  *  {%(a)  (2)  s).  (4-5) 

However,  there  is  an  even  simpler  image  algebra  formulation  of  the  Hit  or  Miss 
transform  which  does  not  employ  the  notions  of  minimum  or  erosion.  We  state  this  in  the 
next  lemma,  using  equation  (4-5)  as  the  representation  of  the  Hit  or  Miss  transform,  "^efore 
doing  so,  however,  we  introduce  the  concept  of  a  census  template.  Let  S  C  X  C  R",  where 
X  is  a  finite  array.  Suppose  every  vector  s  =  (sj , .  . .  ,  Sn)  €  S  is  assigned  a  value  of  0  or  1, 
and  we  wish  to  determine  if  the  pattern  of  O’s  and  I’s  within  S  matches  some  predetermined 
pattern  we  want  to  identify.  One  can  uniquely  identify  every  possible  combination  of  0  and 
1  values  within  S  by  attaching  a  unique  number  to  each  combination.  This  is  done  by  using 
an  invariant  template  whose  support  is  the  set  S  and  each  weight  ty(x),  x  S  S,  is  a  distinct 
power  of  a  prime  number.  When  applied  via  the  operation  ©  to  a  boolean  image,  each  dis¬ 
tinct  pattern  of  O’s  and  I’s  in  S  will  result  in  a  unique  number.  While  this  theory  is  general 
enough  to  identify  patterns  in  n-dimensional  space,  it  is  primarily  used  for  2-dimensional 
image  processing.  The  following  is  a  2-dimensional  census  template. 


96 


16 

8 

4 

•  32 

/  "v 

1 

\  / 

0 

64 

128 

256 

Figure  9.  An  Example  of  a  Census  Template. 


Thb  template  was  used  as  part  of  an  algorithm  to  determine  the  chain  code  of  a  boolean 
image  |60]. 

Lemma  4.5.  Let  t  be  the  boolean  template  corresponding  to  D  and  s  the  boolean  template 
corresponding  to  E  in  the  Hit  or  Miss  transform  (4-5).  Suppose  S+oo(*y)  =  {^ii  •  •  •  »  *ic} 
^+oo(Sy)  =  {Xk+D  •  .  •  (Xn),  where  the  Xj  are  distinct,  i  =  l,...,n.  Then 

k 

Xja©  r),  where  m 

is  equivalent  to  computing  the  Hit  or  Miss  transform,  and  the  a-point  census  template 
r  G  (R^)^  is  defined  by 

**y(*i)  ~  I  0  X  T^X;  for  any  i  =  l,...,n  ■ 

Proof:  Note  that  a  G  {0,1  and  Xm(  ^  {0>1  also.  Let  b  =  ( a  ®  t )  * 

( /\b(a)  ®  s ),  and  let  c  =  Xb(a)-  Then  at  y  G  X,  expression  (4-5)  has  value 


b(y)  =  (  A  a(x)*yx)|*(  A  c(x)  *  ?y(x)  | 


.  A  c(Xi)*S  (Xj) 
1-k+l 


The  pixel  b(y)  will  have  value  1  if  and  only  if  each  of  the  expressions 


97 


has  value  1.  Note  that  the  only  other  possible  value  that  (4-6)  or  (4-7)  can  assume  is 
0.  Now, 


a(x)  *  ty(x)  =  1  V  i  = 


Also, 


<=>  a(xi)  =  1  and  ty(xi)  =  1  V  i  =  l,...,k. 


.  A  c(xi)  *  5‘y(xi) 

i-k+l 


=  1 


c(x)  *  Sy(x)  =  1  V  i  =  k-H,...,n, 


<=>  c(xi)  =  1  and  Sy(Xi)  =  1  V  i  =  k-l-l,...,n. 


Thus,  expression  (4-5)  will  have  value  1  if  and  only  if  a(xi)  =  1  for  all  i  =  l,...,k  and 
c(xi)  =  1  for  all  i  =  k-H,...,n.  But  c(xj)  =  1  if  and  only  if  a(Xi)  =0,  and  this  is  true 
for  all  i  =  k-M,...,n.  Therefore  (4-5)  will  have  value  1  if  and  only  if  a(xi)  =  1  for  all  i 
=  l,...,k  and  a(xi)  =0  for  all  i  =  k-M,...,n.  The  image  Xm(*‘’  ®  **)  assume  a 
non-zero  value  only  when  a  ©  r  has  value  m.  This  happens  if  and  only  if  a(xj)  =  1 
for  all  i  =  l,...,k  and  a(xi)  =  0  for  all  i  =  k-fl,...,n,  as 

n  n 


( a  ©  r  )(y)  =  E  ^  a(x)  •  ry(x)  =  E  a(xi)  ■  ry(xi)=  S  a(Xi)  •  2‘  ^ 


X  e  S(ry) 


i-1 


i-1 


k 

m  =  S  2'“*  if  and  only  if  a(Xj)  =  1  for  i  =  l,...,k  and 
i-1 

=  a(xi)  =  0  for  i  =  k-M,...,n 

an  integer  m  otherwise 

Here  we  use  the  fact  that  S(ry)  =  S+oo(ty)  U  S+oo(Sy)i  that 


98 


^+00  (^y)  ^+00  (sy)  =  0Vy€X(asDnE=(]&).  Therefore  we  see  that  (4-5)  will 

have  value  1  at  location  y  if  and  only  if  Xm(  ^ )  has  value  1  at  location  y . 

Q.E.D. 


VVe  have  shown  that  the  subalgebra  Jl  =(R^,  J,  V,  M  ,  A,  □  )  of  the  full  image  alge¬ 
bra  is  isomorphic  to  the  morphological  algebra  as  described  by  Serra  and  Sternberg.  Since 
invariant  templates  with  the  target  pbcel  included  in  their  support  (the  set  i)  are  a  special 
type  of  templates,  it  is  clear  that  (R^,(R^)^,\/,  E3  ,  A,  E3  )  is  a  much  larger  algebra  than 
the  morphological  algebra.  Templates  generalize  the  concept  of  a  structuring  element. 
Templates  can  vary  in  size,  shape,  and  weights  from  point  to  point  and  they  are  able  to 
express  a  more  general  mapping,  taking  an  image  with  possibly  values  of  +oo  in  m- 
dimensional  space  to  an  image  in  n-dimensional  space  if  we  replace  (R2L)^  by  (R^)\ 
where  X  C  R"*  and  Y  C  R“.  Thus,  an  expression  of  form  a  M  t  can  represent  a  far  more 
complex  process  than  a  simple  dilation.  For  example,  let  a  denote  the  input  image  shown  in 
Figure  10(a)  and  define  t  by 


*(x,y)(‘.i)  = 


0  if  (x,y)  =  (i,j)  or 

if  (x,y)  €  Sj  U  So  and  (0,0)  =  (i,j) 
^  otherwise 


where  =  {  (x,y) :  0.9  < 


<  1.1  }  and  So  =  {  (x,y) ;  0.9  <  ^  <  M  }.  c 

d“  q" 


=  30,  q=15,  p"  =  q'  -f-  c^,  d'  =  c"  —  q“.  In  this  case  a  0  t  is  obviously  not  a  simple  dilation. 


It  is  not  at  all  clear  if  this  transformation  can  be  expressed  in  terms  of  dilations  and  erosions, 


starting  with  the  input  image  a,  o,nd  if  this  is  indeed  possible,  if  the  resulting  expression 
would  be  transparent  enough  to  justify  the  effort.  The  input  and  output  images  are  shown  in 


the  next  two  figures. 


99 


(b) 

Figure  10.  Example  of  a  Non-morphological  Transformation. 

(a)  Input  Image  a;  (b)  Image  a  0  t. 

As  a  last  remark,  we  state  that  a  structuring  element  corresponds  to  a  square  matrix 
that  is  block  toeplitz  with  toeplitz  blocks  that  has  finite  elements  on  the  diagonal.  A  decom¬ 
position  technique  for  a  class  of  invariant  templates  called  rectangular  templates  is  discussed 
in  Chapter  5,  and  presents  the  method  in  matrix  as  well  as  image  algebra  notation. 


i 


L 


100 


CHAPTER  5 

TRANSFORM  DECOMPOSITION 
„NgAV-Matrbc.DecQmpositiQn  Results 

In  this  section,  we  state  matrix  results  which  do  not  appear  in  the  book  Minimax  Alge¬ 
bra  and  which  are  new  results.  We  will  have  particular  use  for  most  of  the  material 
presented  here,  as  it  will  be  used  to  give  necessary  and  sufficient  conditions  for  local  decom¬ 
position  of  lattice  transforms,  The  isomorphisms  are  used  to  map  matrix  algebra  techniques 
to  the  image  algebra. 

As  mentioned  in  the  introduction,  the  use  of  parallel  processors  in  computing  image 
processing  transforms  is  on  the  increase.  Most  transforms  arc  not  able  to  be  applied  directly 
to  a  parallel  architecture.  Instead,  a  particular  transform  must  usually  be  mapped  to  a 
specific  architecture,  that  is,  the  limitations  of  the  machinery  upon  which  the  transform  is  to 
be  implemented  must  be  represented  in  the  mathematical  expression  of  the  transform.  For 
example,  in  the  case  of  the  neighborhood  array  processors,  this  involves  decomposing  the 
transform  into  a  sequence  of  factors  where  each  factor  is  directly  implementable  on  the 
architecture.  To  give  the  general  idea  of  this  approach,  we  represent  the  set  of  processors  by 
a  rectangular  array  X,  X  =  {(i.j) :  0  <  i  <  m.  0  <  j  <  n  ],  and  if  the  neighborhood  has  a 
von  Neumann  or  Moore  configuration,  then  communication  links  between  each  processor  and 
the  neighboring  processors  to  which  it  is  connected  (the  local  network)  are  represented  by 
one  of  the  diagrams  in  Figure  1. 

Here,  the  processor  x  can  distribute  and  receive  information  from  its  four  (or  eight) 
nearest  neighbors.  A  transform  t  is  represented  by  a  template  t  G  (F^)^.  If  we 


101 


can  write  t  as  a  product  of  templates, 

t  =  0  t'  Ea  •  •  •  0  t'‘ 

where  each  t*  has  its  support  S_oo(ty)  as  a  subset  of  the  local  network  configuration  (in  this 
example,  the  von  Neumann  or  Moore  configuration),  for  all  y  €  X,  then  we  say  that  t  has  a 
local  decomposition  with  respect  to  the  network.  Thus,  the  important  consideration  in 
determining  decompositions  is  the  underlying  network  of  communication  between  processors. 
The  network  can  be  modeled  by  a  graph  or  digraph,  with  nodes  as  the  processors  and  edges 
or  directed  edges  as  the  communication  links  between  processors.  The  results  on  decomposi¬ 
tion  that  follow  will  enable  us  to  determine  necessary  and  sufficient  conditions  on  the  graph 
of  the  network  to  guarantee  the  existence  of  a  local  transform  decomposition  under  the 
operation  of  0  or  <2) . 

In  this  chapter,  we  assume  that  is  a  sub-bounded  1-group  of  or  where  F 
is  the  group  of  the  bounded  1-group  F^^ooi  +oo  have  their  usual  meaning  in  con¬ 

text  of  the  respective  bounded  1-groups.  Some  sub-bounded  l-groups  are  R±oo,  Q^oo*  Z±co> 
and  Further,  we  assume  that  any  matrix  described  in  these  sections,  unless  noted  oth¬ 
erwise,  will  assume  no  values  of  -t-oo. 

0.1.1..  Preliminaries 

We  first  give  the  results  in  matrix  notation  in  section  5.1,  and  then  use  the  isomorphism 
to  map  the  results  to  image  algebra  notation  in  section  5.2. 

Propprtip.s  of  thp  trnnspnsp  and  conjiigntp.  Recall  the  notation  t'  which  denotes  the 
transpose  of  the  matrix  t,  and  that  ( t')'  =  t. 

Theorem  5.1.  Let  s  €  M,„p,  t  €  be  given.  Then  (s  X  t)'  =  t'  X  s',  and 
(s  X't)'  =:  t'X's'. 


102 


Proof;  We  give  the  proof  for  (a  X  t)'  =*  t'  X  s'.  The  case  where  x'  replaces  X  is  done  in 
a  similar  way.  First,  note  that  (s  X  t)'  G  and  t'  X  s'  G  also.  Let 
!•  =  s  X  t,  and  u  =  t'  X  s'.  To  prove  the  lemma,  it  is  equivalent  to  show  that 


We  have 

ry  =  SjIj  X  tjfj . 

The  elements  of  the  j-th  row  of  t'  are  ty,  toj, ...,  t„j  and  the  elements  of  the  i-th 
column  of  s'  are  Sii,Si2,  Thus,  the  element  in  the  j-th  row  and  i-th  column  of 
u  =  t'  X  s'  is 


But 

Uji=^5ll^hjX  Sih  =  ‘•'j  ""  ‘‘‘j’ 


Q.E.D. 


Theorem  5.2.  a  x'  b  =  (b*  X  a*)*,  where  a,  b  take  values  in  or  Hi,- 
Proof;  For  matrices  of  the  appropriate  sizes,  we  know 

(fxg)*=g*x'f‘, 

as  is  conjugate  to  (39).  Thus,  setting  g  =  a*  and  f  =  b*,  we  have 

(b*  X  a*)*  =aX'b. 

Q.E.D. 

Let  V  =  {  1,2,. ..,n  },  and  let  W:  V  — ♦>  V  be  any  function  with  the  property  i  G  W(i)  for 
all  i.  Under  the  induced  action  of  the  isomorphism  '1'“^  as  discussed  in  Chapter  2,  W 
represents  the  neighbors  of  processor  Xj  with  whom  processor  Xj  can  communicate.  That  is. 


103 


'I'”*(W(i))  represents  those  processors  in  a  network  whose  memories  processor  Xj  can  directly 
access,  which  includes  itself  (as  i  G  W(i)).  VVe  shall  call  W  a  configuration  function  on  V. 

An  example  of  a  configuration  function  is  the  von  Neumann  configuration  function.  See 
Figure  1(a).  As  before,  we  assume  our  coordinate  set  X  is  an  r  x  s  array,  rs  =  n.  Fix  i  G  V. 
Then  i  =  k*s  +  p,  for  some  non-negative  integers  k,p  where  0  <  p  <  s  (by  the  Euclidean 
algorithm  for  the  integers).  Let  N(i)  =  {  i  -  1,  i,  i  -f  1,  i  -  s,  i  -f-s  },  for  i=l,...,n.  Then  the 
von  Neumann  configuration  function  W  is  defined  by  W(i)  =  { j  G  N(i) :  j  G  V,  and  j  satisfies 
one  of  the  4  conditions:  (1)  j  =  i;  (2)  j  =  h*s  -f-  p,  where  h  =  k-1  or  h  =  k-fl;  (3)  if  p  0,  then 
j  =  k*s  4-  2;  or  (4)  if  p  5^  1,  then  j  =  k^s  -1  }.  This  formulation  takes  care  of  the  truncated 
von  Neumann  neighborhood  on  the  boundary  pixels  of  X,  as  well  as  for  pixels  on  the  interior 
ofX 

We  now  state  some  preliminary  definitions  and  concepts,  making  use  of  the  graph 
theory  developed  in  section  3.3.3. 

Define  5_oo(ti)  =  { j  €  V  :  tjj  5^  —00  }.  The  set  S_oo(ti)  is  called  the  support  of  tj,  in 
accordance  with  the  image  algebra  definition  of  the  support  of  a  template  at  location  Xj.  In 
fact,  ( S_oo(ti ) )  =  5_oo(t',(i)  V  i.  Let  W  be  a  configuration  function,  and  let  t  G 

We  say  t  is  local  with  respect  to  W  if  S_oo(lii)  C  W(i)  for  all  i  G  V.  If  W  is  understood  we 

j  j 

say  that  t  is  local.  A  decomposition  of  t  is  a  set  of  matrices  {t(i)}  such  that  t  =  X  t(i). 

i-l  i«l 

j 

The  set  {t(i)}  is  a  local  decomposition  o/t  G  with  respect  to  W  if  t(i)  is  local  with 
i»l 

respect  to  W  for  all  i  =1 . j.  As  before,  if  W  is  understood,  we  say  simply  that  {t(i) }  a 

l»l 

local  decomposition  of  t.  A  weak  decomposition  is  one  in  which  V  occurs  as  a  template 
operation.  A  weak  local  decomposition  is  a  local  decomposition  that  is  weak. 


104 


Let  D  =  {V,E}  be  a  digraph  with  u,  v  €  V.  We  say  that  v  is  reachable  from  u  if  there 
exists  a  path  from  u  to  v  in  D.  If  u  is  reachable  from  v  and  v  is  reachable  from  u,  then  we 
say  that  the  pair  (u,v)  or  (v,u)  is  mutually  reachable  in  D.  Note  that  in  a  graph  G,  reachable 
and  mutually  reachable  are  equivalent.  A  digraph  or  graph  is  strongly  connected  if  for  all 
pairs  (u,v)  G  V  x  V,  u  is  reachable  from  v. 

We  now  present  the  correspondence  between  template  configurations  and  digraphs. 

For  every  configuration  function  W,  we  can  associate  a  graph  and  digraph  in  the  following 

way.  For  i  G  V,  let  Ej  =  { (j,i) :  j  G  W(i)  },  and  let  F;  =  {  (ij) :  (j,i)  €  E;  }.  The  digraph  of 

n 

W,  denoted  by  D(W),  is  the  digraph  {  V,E  }  where  E  =■  U  Ej.  The  graph  of  W  is  denoted  by 

i-i 

G(W),  and  is  defined  to  be  G(W)  =  {  V,E  },  where  E  =  U  E:  U  U  Fj. 

i»l  i-l 

We  remark  that  if  W  is  the  von  Neumann  or  the  Moore  configuration  function,  then 
G(W)  b  strongly  connected.  These  are  common  types  of  neighborhood  connection  schemes 
used  on  parallel  architectures. 

Now,  we  establbh  the  correspondence  between  oar  weighted  graph  G(W)  and  a  matrix. 
Recall  in  Chapter  3,  we  described  a  one-one  correspondence  between  i  graph  and  a  template 
t  G  (F2^)^.  We  use  the  isomorphbm  to  map  a  template  to  a  matrix,  and  define  the 
weighted  graph  in  terms  of  the  function  a  and  Specifically,  for  a  matrix  t  =  (tjj),  we 
define  the  corresponding  graph  A(t)  as 

A(t)  = 

and  hence  A(t)  has  edge  weight  tjj  for  the  edge  (i,j).  Let  W  be  a  configuration  function,  and 
G(W)  its  graph.  Then  any  matrix  t  associated  with  G(W)  must  satbfy  tj;  =  — oo  if  (i,j)  is 
not  an  edge  in  G(W).  We  will  use  thb  fact  in  determining  local  decompositions  for  an  arbi¬ 
trary  matrix  t.  The  reason  for  the  exchange  of  indices  in  thb  correspondence  now  becomes 


105 


clear.  If  (j,i)  G  E  then  j  G  W(i),  and  local  matrices  must  satisfy  the  condition  that 
S-oo  (*i)  ^  ^(0-  Thus,  for  a  given  i  G  V,  { j  G  V  :  j  G  W(i)  }  represents  the  possible  indices  j 
where  tjj  need  not  have  value  — oo,  or,  equivalently,  the  possible  procesr»rs  Xj  who  can  com¬ 
municate  with  processor  Xj. 

5.1.2.  I.ncfll  Decompositions 

A  matrix  decomposition  of  t  is  a  set  of  matrices  t(l),...,  t(j)  such  that  t  = 

t(l)  X  t(2)  X  •  •  •  X  t(j).  The  t(i)  are  called  the  factors  ot  the  decomposition.  We  write 
j 

t  =  X  t(i)  is  a  decomposition  of  t. 

Lemma  5.3.  Suppose  that  s  X  t  —  r  ts  a  decomposition  of  r.  Then  this  decomposition  is 
not  unique,  and  we  have  s  X  t  ts  also  a  decomposition  of  r,  with 

s  X  X  8,  t  =  X“*  X  t 

and  X  G  F  w  arbitrary. 

Proof;  Suppose  that  s  X  t  is  a  decomposition  of  r,  and  let  X  G  F  be  arbitrary.  Then 
X  X  s  =  8  X  X,  as  (in  our  case)  F  is  commutative.  This  implies  that 

(X  X  8)  X  (X“^  X  t)  =  (s  X  X)  X  (X“*  X  t) 

=  8X  (Xx  X-‘)x  t=sX  ex  t=sx  t=r 
Thus,  setting  s  =  X  X  s,  and  t  =  X"*  X  t,  we  see  that  s  X  t  =  r  also. 

Q.E.D. 

Lemma  5.4.  Let  t  G  .Mn„  he  such  that  tjj  G  F  V  i  =  l,...,n.  Then  t  is  equivalent  to  a 
matrix  s  which  has  the  property  that  Sj-,  =  ^  V  i  =  l,...,n. 

Proof:  Let  d  =  diag(t{^S  ...,  t“^).  Note  that  d  is  invertible  and  d~^  = 

diag{  tji,  too, tn„).  Defining  s  =  d  X  t,  we  find  that  in  computing  Sjj, 


106 


Sii  =  dik  X  tki 
=  dii  X  tii  =  X  tii  =  <f>. 

Hence  we  have 

s  =  dXt  =  dXtXe 
which  implies,  since  both  d  and  e  are  invertible,  that 

t  =  d“^  X  8  =  d“^  X  8  X  e 

for  the  n  X  n  identity  matrix  e,  and,  thus,  t  is  similar  to  3  which  has  the  required 
form. 

Q.E.D. 

A  matrix  t  €  A(„n  satisfying  tjj  =  ^  V  i  =  l,...,n  is  called  a  ^diagonal  matrix,  or  ^ 
diagonal,  for  convenience  of  notation. 

Since  every  matrix  t  such  that  tjj  6  F  is  equivalent  to  one  which  has  ^’s  on  the  diago¬ 
nal,  we  may  use  this  to  our  advantage  and  prove  our  theorems  for  this  special  type  of  matrix 
if  it  is  easier  to  do,  and  use  the  property  of  equivalence  to  show  that  the  theorems  hold  in  the 
more  general  case. 

A  square  matrix  t  G  is  said  to  be  lower  triangular  if  it  satisfies 

tjj  =  —00  if  i  <  j 

and  upper  triangular  if  it  satisfies 

tjj  =  —00  if  i  >  j . 

Lemma  5.5.  Let  t  6  be  <f>- diagonal.  Then  t  has  a  weak  decomposition  into  lower  and 
upper  triangular  matrices.  In  particular,  t  can  be  written  as 

t  =/  V  u, 

where  I  (uj  is  lower  (upper)  diagonal,  and  /jj  =  Uj-,  =  <!>. 


107 


Proof:  Define  /,  u  €  Atnn  by 


^i  = 


U:i  = 


l-H 

if  i<j 

1-00 

otherwise 

f 

k 

ifi>j 

[  —00 

otherwise ' 

This  is 

because 

[/Vulij  =  /ijVUij  = 


tjj  V  — oo  if  i  <  j 
— oo  V  tjj  if  i  >  j 
t|,Vt„=^  iti=j 


which  shows  that  /  V  u  =  t. 


Q.E.D. 

Corollary  6.6.  Let  t  €  M„n  lower  or  upper  triangular  with  the  property  that  tjj  G  F  V  i. 
Then  t  is  equivalent  to  a  matrix  which  is  ^-diagonal. 


An  off  matrbc  b  €  is  a  matrix  which  satisfies: 

bjj  G  F  V  i  =  1, .••>!»,  and  bjj  =  — oo  if  i  ^  j,  except  for  a  unique  index  pair  (i'J*)  such 
that  bj/y  G  F. 

If  t  G  then  we  use  the  notation  tj  to  denote  the  i-th  row  of  t,  and  the  notation 
to  denote  the  j-th  column  of  t.  If  a  G  E",  then  a^  denotes  the  i-th  entry  of  the  vector  a.  If 
t  G  Mnini  then  tjj  denotes  the  entry  at  location  (i,j).  Thus,  for  t  G  Aln,n,  (t)-  =  tjj,  and  h  is 
trivial  to  show 


Proposition  5.7.  (s  X  t)j'  =Sj  X  t^,  H- 

Let  /  G  Mnn  be  lower  triangular  with  all  diagonal  entries  equal  to  0.  For  k=l,...,n, 


define 


108 


Thus,  ’'c  has  form 


''c=eV(/'‘X  {e'^y\. 


k+lM  ^ 


—00  ^ 


Lemma  5.8.  If  Mn„  is  lower  triangular  and  <l>-diagonal,  then 


/  =  X  “C  X  •  •  •  X  ““^c . 

Proof:  By  induction  we  show  that  s  =  X  "c  X  •  •  •  X  '‘c  has  form 


/ji  0 

•  ^32 


^k+l,k 

— OO 


^nl  ^n2 


—00  (f> 


for  1  <  k  <  n-1.  It  is  easily  shown  that 

<t> 

Ini  d> 

I32 


X  'C  = 


<i> 

—00  <i) 

—00  d> 


/„!  /n2  “OO  “00  —00  <j> 


We  assume  that  a  =  X  “c  X  •  •  •  X  ’‘c  has  form  as  in  (5-1)  for  1  <  k  <  n-2. 


109 


Assume  the  induction  step,  and  consider  s  X 

Case  1.  j  k  + 1.  (s  X  )*  =  s  X  ( y  =  s  X  e^,  as  ( y  =  for  j  5^  k+1. 


Continuing,  s  X 


eJ  =s^  = 


/j  l<j<k 
Cj  k+1  <  j  <  n‘ 


k+1 


Case  2.  j  =  k  + 1.  (s  X  '‘+^c)  =  (s  X  '‘+^c)  =8;  X  ('‘+^c)''+^  = 


V  Sji,  X  If  i  <  k+1,  then 

ha*I 


V  Sih  X  '‘+^Ch,k+i  * 

ll“l 


X  “'"Ml 


i 

k+1 

V^Sih  X  ‘‘■^^Ch.k+l 

V 

V 


1 

k+1  , 

n 

V  /ihX 

h-l 

—00 

V 

V  -00  X  ^■'■‘Chk+i 
h-i+1 

V 

V  -00  X  '‘■'■‘chk+i 

h-k+2 

If  i  =  k+l,  then  V  8;^  X  ‘'+^Cb.k+x  “ 

D"! 


y,  Sk+l,h  '‘^^Ch.k+l 

n*l 


V  Sk+l,k+l  ^  *^^^‘^k+l,k+l  ]  V 


V  Sk+i  h  X  ''+^Ch  k+1 
h-k+2  ^  ^ 


V  /k+l,h  X  -00 


h*l 


V 


[^x 


V 


V  —00  X 


h-k+2 


‘^h.k+l 


=  <f>. 


n 

k  , 

i-1 

If  i  >  k+1,  then  V  Sjh  X  ''+‘ch,k+i  = 

n»l 

V  Sih  X  ''+^Ch,k+i 
«•! 

V 

h  Y+i  ^  ^^^h.k+l 

h-k+1 

V  [  Sii  X  ’‘+^Ci,k+i  ]  V 


,  V  Sih  X  '‘■^‘ch,k+i 

h-»+l 


V  /ih  X  -00 

h-l 


V 


i-1 


V  -00  X  ‘‘■^‘Chk+I 

h-k+1  ’ 


V 


<f>X  /i.k+l]  V 


V  -00  X  ^■'■‘Chk+i 

h-i+1  ’ 


i,k+l‘ 


110 


Thus,  for  j  =  k+1,  we  have 


(sX  '‘+10)“+^  = 


—00 


—00 

k+2,k+l 


and  hence, 


■  X 


n,k+l 


/I  /2  /k+lgk+2 


Q.E.D. 


Lemma  5.9.  Let  I E  be  lower  triangular  and  ^-diagonal,  and  let  '‘c  be  as  above, 
k=l,...,n.  Then 


kg  _  n.kg  ""^ikg  X  *  •  • 


where 


i.k 


c 


0 


—00 


<!> 

—00  (i> 
— oo 


-00 


—00 


—00  (j> 


,  for  i  >  k+1 . 


Proof:  We  use  induction  to  show  that  ’'c  =  "’’'c  X  X  ■  •  •  has  the  required 


form.  It  is  easily  shown  that  X  has  form 


Ill 


<!> 

—00  . 

<f>  — oo 

^k+l.k  ^ 

^k+2,k  ^ 

—00  ^ 

—00 

—00  — OO  — OO  <f> 


Let  s  =  ‘’“c  X  X  •  •  •  for  i  <  n-l.  Here, 


we  assume 


8^  5= 


e^ 

—00 

<!> 

^+l,k 

^k+2,k 

hM 

—00 

— oo 

if  j  =k- 


Then  the  j-th  column  of  X  s  is  X  s)j. 

Case  1.  j  7^  k.  X  X  =  e^. 

Case  2.  j  =  k. 


X  s''  =  V  X  Shk-  (5-2) 

n"l 

If  m  <  k,  then  (5-2)  is  X  s''  =  e,^  X  s''  =  — oo. 

For  k  <  m  <  i,  equation  (5-2)  gives  us  X  s''  = 


I  if  m  =  k 
1  ^mk  if  m  =  k-H,...,i 


If  m  =  i-fl  then  X  s''  =  X  s''  = 


112 


If  m  >  i+1  then  X  s'*  =  — oo. 

Thus,  the  k-th  column  of  X  s  has  form 

—00 

<!> 

k+l,k 

h+lM 

—00 

— oo 

Q.E.D. 

We  now  state  the  main  result  of  this  section. 

Theorem  5.10.  Lett  €  M„„  be  a  doubly-F^astic  matrix  with  tjj  €  F  for  all i  =  l,...,n,  and 
W  an  arbitrary  configuration  function.  Then  t  has  a  local  weak  decomposition  if  and  only 
j/G(W)  is  strongly  connected.  Furthermore,  there  is  at  most  one  weak  operation  of\J. 

We  prove  Theorem  5.10  in  several  steps.  First  we  show  that  strong  connectivity  is  a 
necessary  condition.  Then  we  derive  a  general  decomposition  method  for  a  matrix  t,  and 
show  how  elementary  matrices  play  a  crucial  role  in  determining  that  strong  connectivity  is 
sufficient.  In  the  proofs  we  make  no  distinction  between  matrices  with  values  in  either  belt, 
R_oo  O'*  R^ooi  hence  the  operations  of  +  or  *,  denoted  by  the  symbol  X .  This  is  because 
we  will  use  the  isomorphisms  to  show  the  results  hold  for  templates  with  values  in  R_oo 
under  the  operation  M  as  well  as  for  templates  with  values  in  R^oo  under  the  operation  ® . 


113 


■S  1  Np(;esaa.rv  and  Siiffiripnh  Conflitions 


Our  first  theorem  shows  the  sufficiency. 

Theorem  5.11.  If  every  t  G  A(„n  has  a  local  decomposition  with  respect  to  W,  then  G(W) 

15  strongly  connected. 

Proof;  V/e  assume  by  contradiction  that  G(W)  is  not  strongly  connected,  but  every  t  G  .Mnn 
has  a  local  decomposition  with  respect  to  W.  Let  i.j  G  V  such  that  i  b  not  reachable 
from  j.  Let 

D(l)  =  {  k  G  V  ;  i  b  reachable  from  k  } 

D(2)  =s  {  m  G  V  :  m  b  reachable  from  j  }  . 

Note  that  D(l)  fl  D(2)  =  0,  for  otherwbe  z  G  D(l)  D  D{2)  implies  that  i  b  reachable 
from  z  and  z  b  reachable  from  j,  giving  i  b  reachable  from  j.  Let  t  be  a  matrix 
■  which  b  local  with  respect  to  W. 

First,  we  show  that  tjj^  =  — oo  if  k  G  D(l)  and  m  G  D(2).  Suppose  by  way  of  con¬ 
tradiction  that  k  G  D(l)  and  m  G  D(2)  with  ti^^  #  — oo.  Then  since  t  b  local  with 
respect  to  W,  m  G  5_oo(tk)  implies  that  m  G  W(k).  Thus,  (m,k)  b  an  edge  and  hence 
k  b  reachable  from  m.  However,  i  b  reachable  from  k,  so  i  is  reachable  from  m  also. 
Thus,  m  G  D(l),  which  is  a  contradiction,  as  m  b  abo  in  D(2).  Thus, 

G  D(l)  and  m  G  D(2). 


We  now  give  a  matrix  which  cannot  have  a  local  decomposition  with  respect  to  W. 
Define  t  G  Mnn  and  v  G  F"oo  by 


^  if  (k,m)  =  (i,j) 
—00  otherwbe 


Vk  = 


(f>  if  k  =  j 
—00  otherwise  ' 


114 


By  assumption,  there  exists  a  decomposition  of  t,  t  =  X  t'.  At  location  k,  the  vec- 

h“l 

tor  b  =  t  X  V  has  value 


=  .V  t|5i,  X 
n*! 


Vh 


=  tkj  X  Vj 


<i>  if  k  =  i 

— oo  if  k  5^  i 


Let  p  G  { }.  We  show  by  induction  on  p  that  for  c(p) 


p  . 
:(  X 
h-l 


)  X  V,  we  have 


®u(p)  =  every  index  u  G  D(l).  If  this  is  not  true  for  p  =  1,  then  there  exists 

an  index  u  G  D(l)  such  that  Cy{l)  G  F,  as  tA  X  V|j  =  Cu(l).  (Note  that  any  t*' 

k-i 


cannot  have  any  +oo  values,  as  then  t  would  have  +oo  values.)  Thus,  there  must 
exist  a  pair  (u,m)  such  that  G  F  and  G  F,  which  implies  that  m  =  j,  which 
implies  that  m  G  D(2).  However,  m  G  D(2)  implies  that  tjp,  =  —oo  (as  u  G  D(l)), 
which  is  a  contradiction.  Thus,  the  claim  must  be  true  for  c(l).  Now  for  some  p 
G  {2,...,r},  by  induction  assume  the  claim  is  true  for  c(l),...,c(p— 1),  and  that  it  is  not 
true  for  c(p).  Then  there  exists  a  u  G  D(l)  such  that  Cu(p)  G  F.  Therefore  there 
must  exist  an  index  m  such  that  G  F  and  Cn,(p— 1)  G  F.  By  the  induction 
hypothesis,  m  ^  D(l).  Since  t**  b  local,  we  know  that  m  G  5_o<,(tu)  C  W(u)  implies 
that  u  b  reachable  from  m.  Abo,  since  u  G  D(l),  i  b  reachable  from  u  so  m  G  D(l), 
which  is  a  contradiction  to  the  induction  hypothesb.  Therefore  the  claim  that  for 

p  , 

c(p)  =  (  X  t")  X  V,  we  have  c,j(p)  =  — oo  for  every  u  G  D(l)  is  true.  In  particular, 
h“l 


it  must  be  true  for  c  =  c(r)  =  fc  X  v.  Thus,  Ci(p)  =  b;  =  — oo,  which  contradicts  our 
construction  of  c.  Hence  the  proof  b  complete. 


Q.E.D. 


I 


115 


It  can  be  easily  shown  that  the  set  of  all  strictly  doubly  ^4-astic  n  x  n  matrices  with 
entries  in  {  — oo,  ^  }  is  isomorphic  to  the  symmetric  group  Sn  [39].  Since  every  permutation 
<7  can  be  factored  into  a  product  of  transpositions,  every  permutation  matrix  can  be  factored 
into  a  product  of  matrices  corresponding  to  transpositions.  This  leads  to  our  next  definition 
of  exchange  matrices. 

The  exchange  matrix  p'^  G  is  the  matrix  defined  by 


<f> 


— oo 


if  k  =  m  and  k  7^  i,  k  #  j 

or  if  (k,m)  =  (i,j)  or  (k,m)  =  (j,i) 

otherwise 


The  matrix  p‘J  corresponds  to  the  transposition  a  =  (i,j)  G  S^.  Note  that  p*^  X  a  =  b,  where 


2i^  if  k  =  j 

■  aj  if  k  =  i  ,  for  k  =  l,...,n  . 
a|j  \  otherwise 


Lemma  5.12.  Let  (i,j)  G  V,  i  #  j,  let  i,j  denote  a  transposition  in  S„,  and  let  W  6e  0 
configuration  function.  Suppose  there  exists  an  i  -  j  path  in  G(W), 


i  =ko,  kj,  ...,k„  =j  . 

Then  the  exchange  matrix  p'^  can  be  written  as 

p'j  =  p X  p’‘‘‘'*  X  •  •  •  X  p’'"^*^  X  X  •  •  •  X  p''*’'®  X  p 

Proof:  Note  that  (i.j)  G  can  be  written  as 


(i,j)  =(iki)o(kiko)  ■  •  •  o(k„-ij>(kn,_2km_i)o  ■  •  •  o(kiko)o(iki). 

In  the  cyclic  notation  for  the  permutation  o  =  (i,j),  let  i  — *■  j  denote  i  goes  to  j. 
Then  we  have,  for  the  above  permutation,  i  — ►  kj,  kj  ko,  kn,_i  — ►  j,  which 
implies  that  i  — ►  j.  We  also  have  kj  — ►  i  — kj;  ko  ki  — ►  ko;...;  k^_i  —*■  kn,_2 


116 


k^_i;  j  kn,_i  — *■  k^_2  — *• ...  -+  kj  — ^  i,  which  implies  that  k|  — kj,  i  5^  0,m,  and  j 

—►  i.  We  now  use  the  fact  that  S„  is  isomorphic  to  the  set  of  all  strictly  doubly  <fh 
astic  matrices,  where  the  permutation  ‘’■  =  (ii,i2,...,in,)  corresponds  to  the  permuta¬ 
tion  matrix  . defined  by 

if (h,k)  =(if,if+i),  r  =  l,...,m-l 
Phic  =  or  if(h,k)=(i,„,ii) 

~o°  otherwise 

V 

and  the  product  of  two  permutations  cTj ,  corresponds  to  the  matrix  product 
p*^*  X  p^‘.  Thus,  for  <T  =  (i,j)  = 

(iki)o(kik2)  •  •  •  »(km-ij>>{km-2km-i)»  •  •  •  «(kik2)o(iki),  we  have 


ik,  k,k« 

J  *  X  p  *  *  X 


Q.E.D. 


Lemma  5.13.  Let  i,j  €  V,  i  7^  j  and  let  p®^  be  the  exchange  matrix  associated  with  i,j. 
Awume  there  exists  an  i-j  path 


i  kg,  kj,  ...,k,j,  j  . 

Then  the  exchange  matrix  p'j  has  a  local  decomposition  with  respect  to  W. 
Proof;  If  i  G  W(j)  then  we  are  done.  Otherwise,  use  Lemma  5.12  and  write 


p®j  =  p‘'‘‘  X  p'‘‘'‘®  X  •  •  •  X  p''"»-‘^  X  X  ■  •  ■  X  p'‘‘‘'=  X  p“‘>  . 

Since  (k|„k[,+i)  is  an  edge  of  the  graph  G(W)  for  all  h  =0,...,m-l,  we  know  that 


k(,  G  W(k|,+i)  and  also  that  ki,+i  G  W(k(,)  (as  G(W)  is  a  graph),  for  all  h  =0,...,m-l. 


Also. 


{^h+i} 

=  {kh} 

,  {') 


if  r  =  h 
if  r  =  h-fl 
otherwise 


117 


and  hence  C  W(r)  for  all  r  =  l,...,n  and  for  all  h  Thus  p'^ 

is  local  with  respect  to  VV  for  all  h  =0,...,m-l  and  hence 

pij  _  p'^^i  ^  X  •  •  *  X  X  p*'®^*'"*”*  X  •  ■  •  X  p''**'®  X  p'*^' 

is  a  local  decomposition  of  p*j  with  respect  to  W. 

Q.E.D. 

Exchange  matrices  play  an  important  role  in  the  decomposition  method,  as  we  shall  see 
shortly.  We  now  present  a  decomposition  method  for  lower  triangular  matrices. 

The  matrices  '‘e  as  described  in  Lemma  5.8  are  fairly  sparse,  but  certainly  not  local 
with  respect  to  most  architectures.  Lemma  5.9  showed  that  each  matrix  '‘c  can  be  written 
as  a  product  of  even  sparser  matrices,  and  in  the  next  theorem  we  show  that  these  very 
sparse  matrices  can  be  decomposed  locally.  The  matrices  '''c  (i  >  k)  are  off  matrices,  and 
each  is  equivalent  to  a  matrix  which  is  local  with  respect  to  W,  where  G(W)  is  strongly  con¬ 
nected. 

Theorem  6.14.  Let  b  €  .Mn„  be  a  lower  triangular  off  matrix  with  off-diagonal  entry  0  at 
location  (i,j).  Let  W  be  a  configuration  function  such  that  G(W)  is  strongly  connected. 

Then  b  is  equivalent  to  a  matrix  which  is  local  with  respect  to  W.  Furthermore,  b  has  a 
local  decomposition  with  respect  to  W. 

Proof;  If  j  G  W(i)  then  b  is  already  local  with  respect  to  W.  Otherwise,  W(i)  D  {  i  },  and 
then  we  can  find  k  G  V  such  that  k  G  W(i)  (k  i),  assuming  without  loss  of  general¬ 
ity  that  k  >  i.  Let  =  p*'^.  Then  the  matrix  s  defined  by 

s  =  p^'j  X  b  X  q*'' 

is  a  matrix  which  is  local  with  respect  to  W.  To  see  this,  note  that  b  X  q’*^ 
exchanges  columns  j  and  k  of  matrix  b.  Thus, 


118 


<f>  if  h  =  m,  (h.m)  (j,j)  or  (h,m)  (k,k) 

or  if  (h,m)  =  (j,k)  or  (h.m)  =  (k,j) 

13  if  (h,m)  =  (i,k) 

—00  otherwise 

The  matrix  s  =  X  (b  X  q'‘j)  exchanges  rows  j  and  k  of  matrix  b  X  q'‘^,  so 

(f>  if  h  =  m 

(pi'j  X  (b  X  q'‘^))hm  ~  ^  if  (h,m)  =(i,k) . 

— oo  otherwise 

Note  that  5_oo(si)  =  {i,k },  and  5_oo(Sh)  =  {h  }  for  h  #  i.  Since  {  i,k  }  C  W(i),  we 
have  S_oo(Si)  C  VV(i)  and  hence  s  is  local  with  respect  to  W.  Note  also  that  s  (in  thb 
case)  is  no  longer  lower  triangular,  as  k  >  i. 

A  similar  proof  holds  for  the  case  k  <  i. 

To  show  that  b  has  a  local  decomposition,  we  write 

b  =  p''i  X  s  X  q'^i  , 

^ph,m  j-1  __  ph,m  jj  jjj  g  y  sjjjgg  G(W)  is  strongly  connected,  there  exists 

a  j-k  path  for  all  pairs  (j,k)  G  V,  and  hence  Lemma  5.13  applies  to  pi’'.  Thus  pi''  has 
a  local  decomposition,  and  since  s  is  local  and  q'''  =  pi'',  we  have  a  local  decomposi¬ 
tion  for  b. 

Q.E.D. 

We  are  now  in  a  position  to  prove  the  following  general  theorem. 

Theorem  5.15.  Let  I  G  Alnn  be  a  lower  triangular  matrix  with  /jj  — oo  for  all  i=l,...,n 
and  W  a  configuration  function  such  that  G(W)  is  strongly  connected.  Then  I  has  a  local 
decomposition  with  respect  to  W. 


(bxq'Ota  = 


119 


Proof:  By  Corollary  5.6,  /  is  equivalent  to  a  matrix  s  where  Cjj  =  ^  for  all  i; 

/  =  dXcXe  =  dXc 

where  d  is  a  diagonal  matrix.  By  Lemma  5.8,  c  can  be  written  as 

C  =  X  X  •  •  •  X  . 

By  Lemma  5.9,  each  ’'c  can  be  written  as 

~  X  ^**^*^15  X  *  •  *  k+lik^ 

where  the  '’'^e  are  as  in  the  statement  of  Lemma  5.9.  Since  each  '’‘‘c  is  an  off 
matrix,  we  have,  by  Theorem  5.14,  a  local  decomposition  with  respect  to  VV  for  each 

. .  mO.k) 

'•‘‘c  =  X  8(i,k,j) 
j-i 

where  the  s(i,k,j)  are  the  factors  of  the  decomposition,  and  each  s(i,k,j)  is  local  with 
respect  to  W,  and  the  number  of  factors  m(i,k)  is  dependent  on  the  values  i  and  k. 
Thus, 


/=dXc=dX(^cx2cX  -  X  1 
=  d  X  [  "^c  X  •  •  •  ]  X 


®“c  ]  X  ...  X 


m(n.l) 

=  d  X  {  X  s(n,l,j)  X 


ra(2.1) 

X  X  8(2,l,j)  ]  X  .  .  .  X 


m(n,n-l) 

X 

j-1 


s(n,n-l,j) 


where  s(i,k,j)  is  local  with  respect  to  W  for  all  i,k,j. 

Q.E.D. 

Using  the  property  that  (sX  t)'  =  t'X  s'as  stated  in  Theorem  5.1,  we  can  prove 
the  same  sequence  of  theorems  for  an  upper  triangular  matrix  u  satisfying  Up,  ^  — oo  for  all  i 
=  l,...,n.  Thus  we  have 


120 


Theorem  5.16.  Let  u  €  he  an  upper  triangular  matrix  with  €  F  for  all  i,  and  W  a 
configuration  function  such  that  G(W)  is  strongly  connected.  Then  u  has  a  local  decompo¬ 
sition  with  respect  to  W. 

This  leads  immediately  to  the  main  theorem  of  this  chapter. 

Theorem  6.10.  Let  t  €  be  a  doubly-F-astic  matrix  with  tjj  €  F  for  all  i  =  and 

let  W  he  an  arbitrary  configuration  function.  Then  t  has  a  weak  local  decomposition  with 
respect  to  W  if  and  only  if  G(W)  is  strongly  connected.  Furthermore,  there  is  at  most  one 
weak  operation  of\J. 

Proof:  By  Lemma  5.4,  we  know  an  arbitrary  matrix  t  can  be  written  in  form 

t  =  d  X  s, 

where  Sii=  <t>  for  all  i  and  d  is  a  diagonal  matrix.  By  Lemma  5.5,  s  =  /  V  u  where  / 
and  u  are  lower  and  upper  triangular  matrices,  respectively,  with  /»  =  Ujj  =  <^  for  ali 
i.  Thus, 

t=dx(/Vu). 

Suppose  that  G(W)  is  strongly  connected.  By  Theorem  5.15,  I  has  a  local  decompo- 

k  m 

sition  I  —  X  c(j),  and  by  Theorem  5.16,  u  has  a  local  decomposition  u  =  X  r(i). 
j-i  i-l 

Hence,  the  expression 

d  X  |(  .X  c(j)  1  V  ( .X  r(i)  l| 

is  a  weak  decomposition  of  d  X  ( /  V  u  j,  and,  since  each  c(j)  and  r(i)  is  local  with 
respect  to  \V,  t  =  d  X  ( /  V  u  j  = 

d  X  {(  X  c{j)  1  V  1  X  r(i)  ij 
[  1-1  J 


121 


is  a  weak  local  decomposition  of  t. 

Theorem  5.11  shows  the  sufficiency  of  the  statement. 

Q.E.D. 

As  mentioned  earlier,  this  is  by  no  means  the  most  efficient  method  of  decomposition, 
and  that  one  would  not  implement  the  decomposition  by  following  the  constructive  steps 
used  to  prove  Theorem  5.10.  The  results  and  hypothesis  of  the  theorem  are  analogous  to 
conditions  for  existence  of  solutions  of  differential  equations.  Thus,  although  the  results 
guarantee  the  existence  of  locai  decompositions,  efficient  methods  for  computing  them  must 
still  be  developed. 


5.2.  Decomposition  of  Tpmpla.tps 

We  now  present  the  results  without  proof  of  the  previous  section  in  context  of  the 
image  algebra,  using  the  isomorphism  as  defined  in  Chapter  2.  As  before,  we  let  V  =  { 
l,...,n  }  and  W  be  a  configuration  function  on  V.  We  assume  that  X  C  is  finite  and  rec¬ 
tangular,  of  size  h  X  k,  hk  =  n,  with  the  lexicographical  ordering  as  set  out  in  Chapter  2,  and 
that  F_oo  is  a  sub-bounded  1-group  of  or  Everything  in  section  5.1  applies  to  tem¬ 
plates  under  either  (2  or  ® .  We  will  state  the  results  using  the  symbol  0  with  the  under¬ 
standing  that  in  this  section,  0  may  be  replaced  everywhere  by  ®  and  +  replaced  every¬ 
where  by  *,  and  the  results  will  still  be  valid. 

Let  a  G  b  G  F^  be  arbitrary.  We  define  the  outer  product  of  a  and  b  to  be 

an  element  t  G  (F^)'’,  with  gray  values  of 

ty(x)  =  a(y)  -l-b(x). 

We  denote  the  outer  product  of  a  and  b  by  >  a,b  <. 


122 


Let  t  6  (F2^)^.  a  template  decomposition  of  t  is  a  set  of  templates  t(j)  such 

that  t  =  t(l)  E3  t(2)  □  •  •  •  M  t(j).  The  t(i)  are  called  the  factors  of  the  decomposition, 
j 

We  write  t  =  M  t(i)  is  a  decomposition  of  t.  The  decomposition  is  called  weak  if  the  opera- 
i-l 

tion  V  replaces  any  operation  E3  in  the  decomposition.  We  say  t  6  is  local  with 

respect  to  W  if  S_a,(t  J  C  {y: ;  j  6  W(i)}  for  all  x,  6  X  A  decomposition  {t(i)}  of 
' '  i**l 

t  €  (Fi^)^  is  called  a  local  decomposition  of  t  with  respect  to  W  if  t(i)  is  local  with  respect 
to  W  for  all  i  = 

Lemma  5.17.  Let  s  €  (Fi^)"^,  t  €  (F^)^  be  given.  Then  (s  M  t)'  =  t'  13  s';  the  dual 
operations  also  satisfy  this  property. 

Lemma  5.18.  Suppose  that  s  (3  t  =  r  ts  o  decomposition  of  r.  Then  this  decomposition 
is  not  unique,  and  we  have  s  Q  .t  la  also  a  decomposition  of  r  where 

8  =  8  13  X,  t  =  —X  13  t 
and  X  G  F  fa  arbitrary,  X  G  (F^^)^. 

Lemma  5.19.  Let  t  G  (F2^)^  be  such  that  t^(x)  G  F  V  x  G  X  Then  t  is  equivalent  to  a 
template  s  which  has  the  property  that  s,fx)  =  ^  V  x  G  X.  In  this  case,  we  have 


where 


8  =  d  13  t 


d  =  diag(  -t;,j(xi) , -tjxf) , ....  -t^jxj ) . 

If  8  G  (F^)^  satisfies  Sx(x)  =  <i>  for  all  x  G  X,  then  we  say  that  s  is  (fhdiagonal.  A 
template  t  G  (F^)^  is  said  to  be  lower  diagonal  if  4'(t)  is  a  lower  diagonal  matrix,  and 
upper  diagonal  if  'i'(t)  is  an  upper  diagonal  matrix.  If  t  is  lower  diagonal,  then  t  satisfies 


123 


V^i)  =  -00  if  i  <  j 

and  if  t  is  upper  diagonal  then 

=  if  j  <  i- 

Lemma  5.20.  Let  t  G  (F2^)^  he  ^diagonal.  Then  t  has  a  weak  decomposition  into 
lower  and  upper  triangular  templates.  In  particular,  t  can  be  written  as 

t  —  I  \J  xi, 

where  I  (uj  is  lower  (upper)  diagonal,  and  =  u^^Cxj)  =  <A.  Here, 

,,,  i<i 

“  I  —00  otherwise  ’ 

[,  j<i 

[  _(X)  otherwise  ‘ 

Corollary  5.21.  Let  t  G  (F2^)^  be  lower  or  upper  triangular  with  the  property  that 
t;j(x)  G  F  V  X  G  X.  Then  t  is  equivalent  to  a  template  which  is  ^-diagonal. 

A  template  t  G  (F2^)^  is  called  an  off  template  if  'l'(t)  is  an  off  matrix.  The  off-entry 
value  occurring  at  location  (i,j)  is  the  value 

Lemma  5.22.  If  I  G  (F2^)^  is  lower  triangular  and  <f>-diagonal.  Then 

I  =  ^r  M  *r  M  •  •  •  "“^r  , 

where 

“r  =  1  V  |>1„,/.,<|, 

and  >  ,  <  denotes  the  outer  product. 

Note  that  s  =  >  1^^^,  l^^<  G  (F2^)^  and 


124 


Note  that  each  template  '•'‘r  is  an  off  template,  with  off-entry  value  occurring  at  loca¬ 
tion  (i,k). 

The  main  result  of  this  chapter  is 

Theorem  5.24.  Let  t  G  be  a  doubly-F-astic  template  with  t^(x)  G  F  for  all 

X  G  X,  and  W  a  configuration  function.  Then  t  has  a  weak  local  decomposition  if  and  only 
i/G(W)  is  strongly  connected.  Furthermore,  there  is  at  most  one  weak  operation  of\J. 


The  sufficiency  of  Theorem  5.24  is  shown  next  in  Theorem  5.25. 

Theorem  5.25.  If  every  t  G  (Fi^)^  has  a  local  decomposition  with  respect  to  VV,  fhev 
G(W)  is  strongly  connected. 

The  exchange  template  G  (Fi^)^  associated  with  (i,j)  is  the  template  defined  by 


125 


Pr^{u)  = 


tf,  if  z  =  u  and  z  #  Xj ,  z  /  Xj 

or  if  (u,z)  =  (Xi,Xj)  or  (u,z)  =  (Xj,Xi) . 


[  otherwise 

The  template  corresponds  to  the  transposition  a  =  (i,j)  G  and  to  the  exchange  matrix 
p‘j.  Note  that  a  M  p***^  =  b,  where 


b(z) 


a<x.,)  if*=Xj 
a(Xj)  ifz=Xi 
otherwise 


Lemma  5.26.  Let  Xj ,  Xj  €  X,  i  /  j,  and  let  W  be  a  configuration  function.  Suppose  there 
exists  an  i  -  j  path  in  G(VV), 


i  ko,  kj,  ...,k||j  —  j 

Then  the  exchange  template  p***^  can  be  written  as 


=  P*'****  □  P*’'**'^  M  •  •  •  n  P*"^**^  □  Q  ...  13  13  p’''*''>. 


Lemma  5.27.  Let  x-, ,  Xj  G  X,  i  /  j,  and  let  W  be  a  configuration  function.  Let  p*'*^  be 
the  exchange  template  associated  with  i,j.  Assume  there  exists  an  i-j  path 


i  kg  I  kj,  •••jk^  j  . 

Then  the  exchange  template  p’''**  has  a  local  decomposition  with  respect  to  W. 

We  now  present  a  local  decomposition  method  for  an  off  template. 

Theorem  5.28.  Lets  G  (Fi^)^  be  a  lower  triangular,  (j>~diagonal  template  with  off-entry 
value  of  0  at  location  (i,j): 


<!> 

■  0 

— oo 


k  =  m 

(k,m)  =(j,i) . 
otherwise 


126 


Let  W  be  configuration  function  such  that  G(W)  is  strongly  connected.  Then  s  is  equivalent 
to  a  template  which  is  local  with  respect  to  W.  Furthermore,  s  has  a  local  decomposition 
with  respect  to  W. 

Theorem  5.29.  Let  I  G  (F^)^  be  a  lower  triangular  template  with  IJx)  €  F  for  all 
X  €  X,  and  W  a  configuration  function  such  that  G(W)  is  strongly  connected.  Then  I  has  a 
local  decomposition  with  respect  to  W. 

Using  the  property  of  transpose  as  stated  in  Theorem  5.17,  we  can  prove  theorems  5.19, 
5.21,  5.22,  5.23,  5.28  and  5.29  for  upper  triangular  templates. 

Theorem  5.30.  Let  u  €  (Fi^)^  be  an  upper  triangular  template  with  Ux,(Xj)  6  F  for  all  i, 

and  W  a  configuration  function  such  that  G(W)  is  strongly  connected.  Then  u  has  a  local 
decomposition  with  respect  to  W. 

The  main  theorem  follows  immediately. 

Theorem  5.24.  Let  t  €  (F^^)^  6c  a  doubly-F-astic  template  with  tx(x)  G  F  for  all 
X  G  X,  and  W  a  configuration  function.  Then  t  has  a  weak  local  decomposition  if  and  only 
j/G(W)  is  strongly  connected.  Furthermore,  there  is  at  most  one  weak  operation  of\/. 

The  use  of  this  theorem  is  made  clear  by  the  following  discussion.  Suppose  a  lattice 
transform  t  is  to  be  mapped  to  a  parallel  architecture  through  a  decomposition  technique, 
whereby 

h  k 

t  =  13  T:  V  Ea  S:. 
i-l  j-1  ^ 

Applying  the  transform  t  to  an  image  a,  we  have 


‘  h  k  ' 

h 

k 

13  T;  V  0  8: 
i-l  j-1  V 

a  0  [  0  r:  1 

'i-l 

V 

a  13  I  0  8:  1 

L  j-1  M 

a  13  t  =  a  13 


127 


=  [(  •  •  •  ( (  a  M  Ti )  0  rj  )  •  •  •  )  0  Th  ]  V  [(  •  •  •  ( {  a  0  Si )  0  So )  •  •  •  )  0  s,, 
Every  fj,  Sj  is  local  with  respect  to  the  network  of  processors  and  hence  directly  implement- 
able  on  the  parallel  architecture.  The  one  operation  of  maximum  is  related  to  storage. 

When  computing  the  transform  a 0  t,  the  image  j^(  •  •  •  ((a 0  Pj)  0  Po)  •  •  •  )  0  Ph j  must 

be  computed,  the  result  stored,  the  image  |(  •  •  •  ((a0  Si)0  So)  •  •  •  )0  S|j  computed, 
and  then  the  maximum  between  the  two  results  taken.  Most  processing  elements  in  parallel 
architectures  have  a  small  amount  of  local  memory  available.  If  they  do  not,  then  the  host 
machine,  or  another  computer  which  is  able  to  communicate  with  the  parallel  one,  must 
store  the  results  separately  then  return  both  to  the  parallel  machine  to  compute  the  last 
pointwise  maximum.  Thus  the  one  operation  of  maximum  is  not  an  unreasonable  stipulation 
in  a  decomposition. 


.5.3.  ApplieaHnns  to  Rectangular  Templai.Ps 

A  special  class  of  templates  within  the  invariant  ones  are  rectangular  templates.  Let 
xczMxl  =  m,  I  YI  =  n.  Then  a  rectangular  template  t  G  (F^)^  has  a  rectangular 
array  for  its  support  (away  from  the  boundary  of  X).  Specifically,  S_^{ty)  is  a  rectangle,  for 
some  y  G  X  such  that  5_oo(ty)  D  flC  =  0,  where  5X  =  {(i,j) :  i=0  or 
m  —  1  ,or  j  =0  or  n — 1 }.  We  will  use  the  fact  that  invariant  templates  correspond  to 
matrices  that  are  block  toeplitz  with  toeplitz  blocks  (33).  First,  we  derive  the  results  in 
matrix  notation,  and  use  the  isomorphism  to  map  the  theorems  to  image  algebra  notation. 

We  give  conditions  under  which  rectangular  templates  can  be  decomposed  into  the  product 
of  two  strip  templates,  one  vertical  and  one  horizontal.  Since  the  proof  of  this  theorem  is 
constructive,  a  decomposition  is  given.  This  theorem  was  presented  in  its  original  form  by  Li 
(611. 


128 


5.3.1.  Decompositinn  of  bWk  tnpplitz  matrirps 

This  section  presents  results  on  a  decomposition  technique  for  block  toeplitz  matrices 
with  toeplitz  blocks.  Matrices  that  are  block  toeplitz  with  toeplitz  blocks  correspond  to 
invariant  templates.  Conditions  are  given  to  guarantee  a  decomposition  of  such  matrices 
into  a  product  of  two  matrices,  each  of  which  correspond  to  a  strip  template  in  the  image 
algebra.  Strip  templates  are  templates  whose  supports  are  a  1  x  k  array  (horizontal),  or  a  k 
X  1  array  (vertical).  Even  though  these  decompositions  may  not  be  local  ones,  a  decomposi¬ 
tion  of  this  type  will  reduce  the  number  of  computations  per  pixel  necessary  to  compute  the 
transform,  on  either  a  parallel  or  sequential  machine.  As  an  example,  suppose  p  S  (F^)^  is 
a  rectangular  template  and  has  a  h  x  k  support.  Suppose  there  exists  two  templates  t  and  s 
such  that  r  —  s  M  t,  where  t  is  a  rectangular  template  with  a  h  x  1  support  and  s  is  a  rec¬ 
tangular  template  with  a  1  x  k  support.  Then  for  a  6  and  using  the  associative  property 
of  the  0  operation,  we  have 

a0r=a0(s0t)  =  (a0s)0t. 

Computing  a  ©  r  directly  involves  finding  the  maximum  of  hk  additions,  while  computing 
(  a  0  s  )  0  t  involves  finding  the  maximum  of  h  -I-  k  additions.  Thus  savings  in  computa¬ 
tion  can  be  realized  on  sequential  as  well  as  parallel  processors. 

Minimav  matrix  rpaiilts.  We  shall  first  prove  the  following  lemma. 

Lemma  5.31.  Suppose  thats  €  is  a  diagonal  matrix  of  form  s  =diag(a,  ...,a),  where 
cv  G  F.  Then  for  t  G  s  X  t  =  t  X  s. 

Proof:  Let  r  =  3  X  t,  and  w  =  t  X  s.  Then 

fij  =  sjk  X  t^j  =  Sij  X  tjj 


and 


129 


Wij=V^  tjkX  Sy  =  tijX  Sij. 

Since  t;;  =  tjj  =  a  V  i,j  =  we  have 


rjj  =  Sij  X  tjj  =  Sij  X  t,i  =  tii  X  Sij  =  Wij . 


Q.E.D. 


Lemma  5.32.  Suppose  s,t  G  Mnn  are  each  block  toeplitz  with  toeplitz  blocks.  Denote  s 
and  t  by 


8  — 


t  = 


.  .  .  ri"* 


Suppose  s  and  t  satisfy  the  following  conditions: 

(1)  n  =  hk,  and  each  block  <7*1,  t'I  is  a  k  x  k  matrix; 

(2)  there  exist  2h-l  constants  Xjj  such  that  =diag{Xjj, . . .  ,Xij)  and  the  matrix 
A  =  ( Xij )  G  Mhh  is  toeplitz; 

i?^j 

(3)  (7*1  =  1  _  . .  .  .  where  4>t  denotes  the  null  matrbc  of  size  k  x  k,  and  a  is  a  toe- 

^  ’  I  >Mhh  1  =J’ 

plitz  matrix  where  a^j  G  F; 

If{\)  •  (3)  are  satisfied,  then  s  X  t  =1  X  s  and  is  of  form 


(T^^Xr^^  a^^XT^"  .  .  (T^^X 
<722x  r*!  <7=2x  r2  _  ;2h 


cr*'’*X  7**^  .  .  .  cr^**X  7^** 

Proof:  Let  u  =  s  X  t,  and  denote  the  matrix  u  =  ( Uij )  in  block  notation  by 


130 


.  .  /xih  ■ 

Then 


n 

«ij  =  V  + 

'  m-l 

For  i,k  G  Z'*'  and  using  the  division  algorithm  for  integers,  wc  have 

i  =  k  •  Pi  +  rj  for  some  0  <  Pj  <  h  and  0  <  ri  <  k — 1 . 

If  ri  =*0  then  write  i  =  k'(pi— 1)  +  k,  where  0  <  Pj  <  h—1.  Thus,  (abusing  nota¬ 

tion)  in  this  case,  we  can  always  write 

i  =  k  •  Pj  -H  r]  for  some  0  <  Pi  <  h — 1,  and  0  <  rj  <  k . 

Similarly,  we  have 

j  =  k  •  Pj  +  rj  for  some  0  <  pj  <  h — 1,  and  0  <  rj  <  k . 

The  element  Ujj  lies  in  the  ( Pj  -fl ,  pj  +1  )-th  block  of  u,  and  is  the  ( rj ,  rj  )-th  entry  in 

that  block.  The  matrbc  s  has  values  of  — oo  when  m  indexes  outside  the  block 
i.e., 

®im  ~  — oofor  valuesof  m  not  in  {w:i— ri+l<w<i— rj+k}. 
i-ri+k 

Thus,  Ujj  =  V  Sjn,  -f  t,„j.  Since  the  values  sj^,,  i — r,  +1  <m  <i — n  +k,  lie  on 

m»i-ri+l 

the  rj-th  row  of  block  and  the  values  tn,j  lie  on  the  rj-th  column  of  block 

^,+i.Pj+i^  the  value  Ujj  can  be  represented  by 

This  is  true  for  any  value  of  Ujj  such  that  Ujj  lies  in  block  i.e., 


131 


l<r,.ri<k. 

Thus  we  have 

^i+i,Pi+i  ^  ^i+i,pj+i  _  ^pi+i,pj+i 

Since  i,j  were  arbitrary,  this  relation  holds  for  each  value  1  <  Pi+1 ,  Pj+1  <  h. 

That  is, 

=  /i^j ,  Vi,j  =l,...,n. 

To  show  commutativity  of  the  product,  we  note  that  for  all  i,j,  o**  is  toeplitz  with 
finite  elements  on  the  diagonal,  and  the  block  is  a  diagonal  matrbc  with  a  constant 
value  on  its  diagonal.  Thus,  we  have  by  Lemma  5.31,  for  all  i,j, 

X  X  &K 

Let  w  =  t  X  s,  w  =  ( in  block  notation,  i,j  =  l,...,h.  We  show  that 

cJj  *  X  Here,  ~  *'im  ^  ®mj-  ^  before  and  using  the  same  method  and 

in»l 

notation,  we  have  the  same  restrictions  on  the  indices  of  s,„j: 

s„,j  =  — 00  for  values  of  m  not  in  {w;  j  — Tj+l  <w<j  — Tj+k}, 
and  the  values  s^j  for  j  — rj +1  <m <j  — rj  +k  lie  on  the  rj-th  column  of  the  block 
^j+i,Pj+i  (.jjg  values  tj„,  for  j  —  rj  +  1  <m <j  —  rj +k  lie  on  the  rj-th  row  of 

block  i.e., 

and  thus  in  general  we  have 

fix  (^  =  Ji. 

Since  o'*  =  (f  i  for  all  i,j,  we  have 


(Ji  =  fix  oii  =  f>x  (fi  =  (fix  fi  =  ffi. 


Q.E.D. 


132 


Let  us  define  a  strip  matrix  s  €  ^  ^  template.  A  vertical  strip 

matrix  t  €  Mnn,  n  =  hk,  is  a  matrix  which  is  block  toeplitz  with  toeplitz  blocks,  t  =  (/^), 

each  is  k  X  k,  i,j  =  1 . h,  and  =diag(cifij, . . .  ,Qrjj)  for  some  Qy^  e  F.oo-  Since  = 

if  j  >  i  and  I'j  =  if  i  >  j,  there  are  actually  only  2h-l  constants  Ojj  which  deter¬ 
mine  the  T'j.  Denote  these  constants  by  QJii ,  Q^ii  •  Qfij  i  ij  =2,...,h,  where 

diag(aii . ail)  ifj^j 

r'j  =  diag(Q;ij, . . .  .ttij)  if  j  >  i  . 
diag(aii, . . .  ,Qfii)  >f  i  >  i 

Also,  we  must  have  6  F  for  j  =l,2,..,mi  and  O’-,!  6  F  for  j  =1,2,.., mo,  for  some  1  <  mi, 
mo  <  h.  Vertical  strip  matrices  correspond  to  vertical  strip  templates. 

A  horizontal  strip  matrbc  s  €  Mnn*  n  =  hk,  is  a  matrix  which  is  block  toeplitz  with  toe- 

(  i  5*^  j 

plitz  blocks,  8  =  ((3*^ ),  each  <7*^  is  k  x  k,  i,j  =  l,...,h,  and  o'J  =  i  a  €  A(|,h  i  =  j’ 

denotes  the  null  matrix  of  size  k  x  k,  and  a  is  a  toeplitz  matrix  where  a^;  G  F.  Horizontal 
strip  matrices  correspond  to  horizontal  strip  templates.  Note  that  such  a  matrix  is  a  diago¬ 
nal  block  toeplitz  matrix  with  toeplitz  blocks,  that  is,  the  only  non-null  block  is  the  diagonal 
block. 

We  are  now  in  a  position  to  prove  the  main  result  of  this  section. 

Theorem  5.33.  Assume  u  G  n  =  hk,  is  block  toeplitz  with  toeplitz  blocks,  and  write 


where  each  is  o  k  x  k  submatrix  and  5^  “Oo  V  i,m.  Then  u  is  decomposable  into 

two  strip  matrices,  one  horizontal  s  and  one  vertical  t,  if  and  only  if 


133 


(1)3  2h-l  constants  Xjj,  Xj^,  i,j  =  with  Xjj  =0,  such  that  A  ^(Xjj)  G  Alhh  is  toeplitz 
and  of  form 


0 

^12  ' 

•  ^Ik 

• 

0  . 

•  • 

\l 

XjjO  • 

.  0 

(2)  the  constants  satisfy  Xjj  X  /i‘  =  for  all  i,j  = 

In  this  case  a  decomposition  is  given  by  the  following  matrices  which  are  each  block  toeplitz 
with  toeplitz  blocks: 


'  .  . 

7^^  . 

.  r'*'  ■ 

<7^^  .  . 

II 

,  • 

•  • 

where  each  of  0*^,  arc  k  x  k  matrices  and  satisfy: 


=  diag{Xij . Xij);  = 


i  j 


Here,  X^  =0,  and  X^  =Uij  X  (uii)"S  X^  =Uii  X  (uJ'S  i,j  =2,...,h. 


Proof:  Suppose  (1)  and  (2)  are  satisfied.  We  show  that  for  s  and  t  as  in  the  statement  of 
the  theorem,  s  X  t  =  u.  Let  w  =  s  X  t.  By  Lemma  5.32,  w  has  form 


.  .  cr^^X  7^** 

(T’Xr^  . 

.  .  o^-xr" 

..  (7*'*‘X 

Choose  an  element  Wjj.  As  in  Lemma  5.32, 


134 


i  =s  k  •  Pi  +  rj  for  some  0  <  Pj  <  h — 1,  and  0  <  Fi  <  k . 
j  =  k  •  Pj  +  Fj  foF  some  0  <  pj  <  h  — 1,  and  0  <  fj  <  k . 

and  the  paiF  (i,j)  lies  in  the  block  ( pj  +  1,  pj  +  1 ).  Thus, 


Wii  -(sxt>,i  X 

i— fj+k 

~  V  '*im  "b  ^mj‘ 
m— I— ri+1 

Since  —  ^i®'8(\,+i,pj+i.  •  •  •  >\i+i,pj+i)>  possible  non-null  element 

of  {t,„j ;  i— Fi-f-1  <  i— Fi-fk}  lies  on  the  diagonal  of  and,  in  fact,  lies  in 

the  (Fj,Fj)-th  position  in  The  (Fj,Fj)-th  position  in  is  jn  the 

(i,i— Fi-fFj}-th  position  in  u.  So 


^ij  ^l-rj+rj  ^  \+l,Pj+l" 


By  (1)  we  know  that 


\  V  ,.Pl+l'Pl  +  l  _  ,.Pi+i>Pj  +  t 

■^Pi+i.pj+i  ^  ^  M 


which  implies  that 


which  implies 


=  ”ii- 


ThcFefoFe, 


=  «u-, 


'*ij  '•i.i-ri+rj  ^  \,+l,Pj+l  —  . 

Now  suppose  that  u  is  decomposable  into  two  stFip  matFices,  a  hoFizontal  one  s  and 
a  vcFtical  one  t.  We  show  that  s  and  t  satisfy  conditions  (1)  -  (3).  By  the  definition 
of  a  veFtical  stFip  matFix,  we  see  that  t  is  block  toeplitz  with  toeplitz  blocks,  and 


135 


t  =  T'j  is  of  form 


= 


diagCttn, .  . 

•  1  ®ii) 

if  i  =  j 

diag(Q;ij, . . 

•  .Q'lj) 

if  j  >  i 

diagCorji, . . 

if  i  >  j 

for  some  2h-l  constants  o-ji ,  o-ji ,  Ofy ,  i,j  =2,,..,h. 


Seto-ii  =0,  Q!ij  =Uij  X  (uii)"*, 


Q-ii  =  Uii  X  (Uii)-^  Define  A  =(Xij)  G  ^hh  by  >Sj  = 


0^11 


if  i=j 

if  j  <  i.  Then  A  is 
if  i  <  j 


toeplitz,  with  the  2h-l  constants  \i  i,j  =2,...,h,  satisfying  condition  (1). 

Also,  8  is  a  horizontal  strip  matrix,  which  means  that  (using  block  notation) 

8  =  (o*^),  where 


i  5^  j 

^^=1  aGMhh  i=j’ 


and  a  is  toeplitz  with  non>nuiI  diagonal  entries.  Set  a  =  {IK  The  last  thing  we  need 
to  show  is  that  condition  (2)  is  satisfied.  By  Lemma  5.32,  the  (i,j)-th  block  of  s  X  t 
is  of  form 


(/■'X  T^j, 


and  since  is  diagonal,  o*'  X  =/^X  (f'.  So,  f>X(f'=  diag(  ttjj ...,  Oij )  X  i^'  = 
Xjj  X  =  fiK  Obviously,  s  X  t  =  u,  and  this  completes  the  proof. 


Q.E.D. 


The  statement  in  the  image  algebra  of  Theorem  5.33  is 

Theorem  5.34  [61].  Let  r  €  (R^)^  be  a  rectangular  template  with  non-null  weights 
rx|(Xj),  i  =  l,...,m,  j  =  l,...,k.  Then  r  has  a  decomposition  into  two  strip  templates,  one  hor¬ 
izontal  and  one  vertical,  if  and  only  if  for  all  1  <  i,i'  <  m  and  1  <  j,j'  <  k, 


rx,(xj) =  rx,(xj') -rx;,(xj')- 


136 


CHAPTER  6 

THE  DI\TSION  ALGORITHM 

LJ _ A  Division  AIgni-itlim  in  a  Non-Eiiolirlonn  Domnin 

The  integers  have  the  property  that  a  division  algorithm  can  be  defined  on  them.  For 
a,b  G  Z,  there  exist  unique  integers  q,r  such  that  a  =  qb  +  r  where  I  rl  <  1  bl.  This  is  an 
example  of  an  integral  domain  upon  which  is  defined  a  Euclidean  valuation  [61].  In  this  sec¬ 
tion  we  present  a  division  algorithm  for  the  minimax  algebra  structure,  and  give  an  applica¬ 
tion  of  this  result  to  image  processing  in  the  image  algebra  notation. 

We  remark  that  the  boolean  case  has  already  been  stated  by  P.  Miller  [62],  and  will  be 
discussed  in  more  detail  at  the  end  of  this  section. 

6JJ _ A  Miirriv  Divisinn  Algorithm 

Let  E^  =  be  a  sub-bounded  1-group  of  R_oo'  For  notational  convenience,  >ve  will 
write  t  €  .Mn„(-oo)  when  we  mean  that  the  matrix  t  will  assume  values  only  on  F  U  {-«>}. 
Similarly,  we  write  t  G  >ln„(+oo)  when  the  matrix  t  assumes  values  only  on  F  U  {+oo}.  We 
will  show  that  for  a  finite  vector  a  G  F"  and  a  subset  of  matrices  of  that  there  exist 

vectors  q  and  r  G  f""”  such  that 

a  =  ( t'  X  q  )  V  r 

Lemma  8.1.  Let  a  G  F"  6e  finite,  and  t  G  Afn„(-oo)  satisfy  \  =  l,...,n. 

Define  i  by  i  =  (t*)'.  Then  both  t  X'a  and  t'  X  (t  x'a)  are  finite,  and 

Vx  (tX^a)^a. 


137 


Proof:  First  we  note  that 


=  [(fyiij  =  (t*v.  =  (ty)* 


-tij  iftjjGF 
+00  if  tjj  =  —00 


and  that  S+oo(M  =  ^-oo(*i)-  Let  b  =t  x'  a  and  let  c  =t'  X  b.  At  location  i, 
hi  =  A  t::  +  a:.  The  vector  b  is  finite,  as  for  i  €  {1,2 . n},  there  exists  at 


least  one  j  €  5+oo(ti )  such  that  ty  +  aj  is  finite,  since  by  hypothesis,  S_oo(ti)  5^  V 
=  l,...,n,  and  a:  €  F  V  i  =  also.  Thus,  b:  =  I  A  ti,  +  a:  iG  F  V  i.  At 

location  i,  C;  =  V  t'jj  +  bj.  By  a  similar  argument,  we  see  that  cj  6  F  V  i. 

jG5_oo(ti) 


Suppose  that  C:  =  t'jjj  +  b|j  for  some  k  G  {1,2,. ..,n}.  Then  bj.  =  A  U:  +  a:  = 

jewtk) 

tijp  +  ap  for  some  p.  Since  k  G  S_oo{^\),  we  know  that  i  G  S_ao{h)‘  Since  — 

^+oo(*k)>  *  ^^+oo(*k)»  hence 


tkp  +  +  ai  ^  F . 

by  our  choice  of  p  and  the  fact  that  t|jj  G  F  and  aj  G  F  V  i.  Thus 

Ci  =  t'ik  +  hk  =  fc'ik  +  ^kp  +  ap  —  ’'^ik  +  Hi  +  ai  =  tki  +  ( “Hi )  +  aj  = 

Thus,  C;  <  aj,  and  our  lemma  is  proved. 

Q.E.D. 


VVe  now  state  the  Division  .\lgorithm. 

Theorem  6.2.  The  Division  Algorithm.  Let  a,,  t  satisfy  the  hypothesis  of  Lemma  6.1. 
Then  for  q  =  t  x'  a,  and  r  defined  by 


a,  if  aj  >  (t'  X  (t  x'  a)ji 
-00  if  aj  =  [t'x  (t  x'a)]i 


138 


we  have 

a  =  ( t'  X  q  )  V  r 

Proof:  By  Lemma  6.1,  a  >  t'  X  q  =  t'  X  (t  x'  a),  and  hence, 

a  >  r. 

Thus,  (t'  X  (t  X'  a)l  V  r  <  a.  To  show  that  equality  holds,  that  is,  that 
[t'  X  (t  x'  a)]  V  r  =  a,  we  examine  two  cases. 

Casel.  84  >  [t'x  (tx'a)];.  Here,  [t'x  (t  X' a)];  V  f;  =  [t' X  (tx'a)]iVai  = 
a,. 

Case  2.  a,  =  (t' X  (t  x' a)]i.  Here,  [t' X  (t  x' a)lj  V  Tj  =  a,  V  n  ==  84  V -oo  =  a;. 

Q.E.D. 

Now  suppose  we  have  a  =  ( t'  X  q )  V  r  for  a  finite,  t  6  Mnn("")  *  satisfying 

5-00  (^i)  /  0  V  i  =  1 . n.  Define 

a®  =  a 
r°  =  r,  and 
a‘  =t  x'a'“L 

Then  we  have 

a  =  a°  =  (t'X  aMVrO  (6-1) 

By  Lemma  6.1,  a^  =  t  x'  aP  is  finite,  and,  in  fact,  a'  =  t  x'  a‘~^  will  be  finite  for  each  i  = 
1,2,...  Thus,  the  Division  Algorithm  applies  in  particular  to  a^; 

=  ( t'  X  a' )  V  (6*2) 

and  substituting  (6-2)  into  (6-1),  we  get 
a  =  ( t'  X  aM  V  r° 


139 


=  { t'  X  (( t'  X  a- )  V  ]  }  V  Tq 

=  (t'xt'xa2)  V  (t'XrM  V  r° 

=  ((t')”X  a-j  V  (t'x  V  r°  (6-3) 

k 

where  (t')*'  denotes  the  k-fold  product  of  t,  X  (t'). 

i-l 

Apply  the  Division  Algorithm  to  a",  to  get 

a2=(t'x  a^)  V  r 
and  substituting  this  into  (6-3),  we  get 

a  =  {(t')”X  [(t'X  a?)  V  r]  }  V  (t'x  rM  V  r° 

=  ((t')2x  t'x  a^l  V  [(tf  X  T’]  V  [t'x  rM  V  r° 

=  [(t')3x  a®]  V  ((t')2x  r2]  V  [t'x  r^l  V  r° 

We  can  continue  like  this  up  to  any  k-th  iteration. 

a  =  rO  V  [t'x  ri]  V  [(t')2x  r]  V  V  [(t')'‘ X  r'']  V  [(t')'‘+ix  a''+‘| 

or,  if  we  let  (t')°  denote  the  identity  matrix  e,  we  have 

a  =  ^  [(t'yx  r‘|  V  [(t')'‘+ix  a^+^l 

i»l 

We  now  state  a  result  which  will  be  useful  in  describing  the  division  algorithm  in  the 
image  algebra. 

Lemma  6.3.  Let  a,b  G  F"  (be  finite  vectors).  Then  we  may  express  the  difference  of  vec¬ 
tors  a  andb,  a  -  b,  using  the  following  matrix  transform.  Define  s  G  Mnn(~‘X3)  by 
s  =  diag((bi)*,...,(bj*)  =:diag(—bi,.. .,—!)„)  with  — b;  denoting  the  real  arithmetic  additive 
inverse  of  the  real  number  bj.  Then 


s  X  a  =  c  G  F",  where 


140 


Ci  =  aj— bi,  i  = 


Proof; 


(s  X  a)i  =  (Sik  +  aj  =  Sii  =  34  =  -bj  +  a, 

for  i  =  1 . ,n. 


Q.E.D. 


VVe  remark  that  the  vector  r  as  defined  in  Theorem  6.2  can  be  obtained  in  the  follow¬ 
ing  way.  Fix  a  €  F",  finite.  Define  f^  :  F"  — +•  F"  by 


f^(x)  »»  y  where  yj  =* 


if  £4  >  Xj 
otherwise  ’ 


Then  for  x  •-=  t'  X  (t  x'a),  fjx)  = 


0 


—00 


if  aj  >  (t'X  (t  X'a))j 
if  34  <  (t'X  (t  X'a))i  ■ 


However,  it  is 


easily  shown  that  f^  is  not  an  s-lattice  homomorphism.  For  example,  choose  n  =:2,  a,  d,  and 
e  as  below; 


Thenfa(d  Ve)=^ 


but  f  Jd)  V  f  Je)  = 


0 

V 

-00 

0 

0 

0 

0 

5^f»(dVe). 


Thus,  according  to  Theorem  3.1,  f^  cannot  be  represented  as  a  matrix  transform.  If, 
however,  we  go  outside  of  the  structure  of  the  minimax  algebra,  and  use  image  algebra 
operations  in  addition  to  V  and  M  (or  ® ),  we  can  express  this  transform  in  a  succinct  way, 
as  will  be  demonstrated  in  the  next  section. 


141 


A  final  division  algorit-.hm  The  duality  of  the  operations  of  the  matrix  algebra  enable  us 
to  describe  a  dual  division  algorithm.  We  omit  the  proofs,  as  they  are  the  dual  of  the  proofs 
given  in  the  previous  section. 

Lemma  6.4.  Let  a  E  F"  6c  finite,  and  t  €  Ainn(+“)  '?+<»(^i)  5^  0  V  i  =  l,...,n.  Define 
t  by  t  =  (t*  y.  Then  both  t'  X  a  and  t  x'  (t'  X  a)  are  finite,  and 

tx'(t'X  a)  >a 

Lemma  6.S.  (The  Dual  Division  Algorithm)  Let  a,t  satisfy  the  hypothesis  of  Lemma 
6.4.  Then  for  q  =  t'  X  a,  and  r  defined  by 


we  have 


a,  if  a<  <  [t  x'(t'x  a)]i 
+00  if  aj  =  (t  X' (t'x  a))j 


a  =(  t  X'q  )  A  r. 


6.2..  An  Image  Algebra  Division  Algorithm 
Using  the  isomorphism  we  can  express  these  ideas  in  the  image  algebra.  Let 

t=(t7fort€(FX)-^ 

Lemma  6.6.  Let  a  G  F^,  t  G  (F^)^  be  such  that  S_oo(tx)  5^  0  V  x  G  X.  Then  each  of 
a  IZ3  t  and  ( a  1Z3  t )  0  t'  are  finite,  and  a  >  ( a  □  t )  0  t'. 

This  ne.xt  theorem  is  the  counterpart  to  Lemma  6.3. 

Lemma  6.7.  Let  a,b  G  F^.  Then  the  image  c  =  a  -  b  may  be  expressed  using  a  template 
in  the  following  way.  Define  s  G  (F^^)^  by 


142 


f-b(y)  ifx=y 
®yW  I  _oo  otherwise' 

Then  a  Q  s  =  a  —  b. 

Using  the  lattice  characteristic  function,  it  is  sometimes  the  case  that  we  can  stay 
within  the  lattice  operations  of  V  and  0  and  the  image  algebra  operation  of  4-  when  needing 
to  express  a  characteristic  function.  An  example  of  this  follows  immediately. 


Theorem  6.8.  The  Division  Algorithm.  Let  a,  t  satisfy  the  hypothesis  of  Lemma  6.6. 
Then  for  q  =  a  0  t  and  r  defined  by 


*•  =  a  +  X>o(a)la-((a  0  t)  0  t')] 

we  have  that 

a=(qMt')Vr. 

Proof:  We  need  to  show  that  'I'”^(r)  matches  with  our  definition  of  the  matrix  r  in 

Theorem  6.2.  Let  b  =  (a  0  t)  Q  t'.  Then  using  Lemma  6.7,  a  -  b  =  a  0  s, 
where 

(-My)  ifx=y 
®yW  I  _oo  otherwise 

Thus,  a  —  b  >  0  implies  that  a  0  s  >  0.  Now, 


Q.E.D. 


143 


Iterating  k  times  on  an  image  a  and  a  template  t  satisfying  the  hypothesis  of  Lemma 
6.6,  we  obtain 

s>L=^  ((r‘  m  V  ((a'‘+i  m 

i"*l 

where  any  template  t  raised  to  the  zero-th  power,  t°,  is  the  identity  template,  e. 

In  the  boolean  case,  there  exists  an  integer  m  such  that 

a„,I2|  {tr=0 

so  that  the  expression  for  a  becomes 

a  =  ^  ri^  0 
k-o 

One  useful  application  of  this  result  is  in  data  compression.  By  encoding  the  rj’s  in  run 
length  code,  the  image  can  be  represented  by  fewer  bits  of  data,  and  reconstructed  exactly 
once  t  is  known. 

We  have  the  dual  Division  Algorithm  stated  in  the  image  algebra  also. 

Propoaitioa  6.9.  Let  a  6  R^,  t  €  (R^oo)^  that  S_oo{K)  5^  0  V  x  G  X.  For  i  by 

t  =  (t*y,  we  have  that  each  o/a  0  t'  and  {a,  0  t')  □  t  are  finite,  and  a  <  (a  0  t')  E3  t, 

Proposition  6.10.  (The  Dual  Division  Algorithm).  Let  a,  t  satisfy  the  hypothesis  of 
Proposition  6.9.  Then  for  q  =  a  0  t'  and  r  defined  by 

r  =  a  Ax<o(a-((a  0  t')  E3  t)] 

we  have  that 


a  =  (  q  0  t )  A  r. 


144 


CHAPTER  7 
TWO  EX.\MPLES 

7  1.  Vn  Qpprntinns  Rp«i<»nrcli  Prohlpm  Stnrofl  in  Im!\gp  Algohrn.  \r>tniir>n 

This  section  gives  a  short  description  of  the  transportation  problem  in  linear  program¬ 
ming  [63|,  and  provides  a  translation  of  the  dual  transportation  problem  into  image  algebra 
notation.  Thus,  this  provides  an  e.xample  of  the  use  of  the  isomorphism  '1'“^ 

Let  m  producers  and  n  consumers  of  some  commodity  be  given.  Let  pj  denote  the  pro¬ 
duction  capacity  of  producer  i,  dj  denote  the  demand  of  consumer  j,  and  Cjj  denote  the  cost  of 
transporting  one  unit  of  commodity  from  producer  i  to  consumer  j.  The  problem  is  to  deter¬ 
mine  how  much  commodity  to  ship  from  each  producer  to  each  consumer  so  that  consumer' 
demands  are  met,  production  capacities  are  not  e.vceeded,  and  transportation  costs  are 
minimized.  This  can  be  formulated  as  a  linear  programming  (LP)  problem,  which  we  state 
as  follows. 

Let  Zjj  be  the  number  of  units  of  commodity  to  be  shipped  from  producer  i  to  consumer 
j.  Then  the  total  transportation  cost 

m  11 

is  to  be  minimized.  To  stay  within  production  capacity,  we  also  must  have 

n 

Szij  <  Pi.  i  =  L...,m 

j-i 

and  to  satisfy  consumer  demands  we  must  have 


145 


Thus  the  LP  problem  is  to 


m 


>  i  • 

i"l  '  ‘ 


Minimize 


m  n 

Si?.''!"'* 


Subject  to  — 5jZjj>— pj,  i  =s  l,...,m  (7-1) 

j-i 


m 


S  Zjj  >  d: ,  j  =  ,  and  (7-2) 

i-i  ‘  ‘ 

Zjj  >  0  for  all  i,j. 


Let  Xj  be  the  dual  variable  associated  with  the  i-th  constraint  in  (7-1),  and  yj  the  dual 
variable  associated  with  the  j-th  constraint  in  (7-2).  Then  the  dual  transportation  problem  is 
|M| 


m  n 

Maximize  —  S  p:X:  -f  Li  d:y: 

i-i  j-i  ^  ^ 

Subject  to  — X;  +  yj  <  Cjj  for  all  i,j 

Xi>0,  yj>0  for  all  i,j. 

This  is  equivalent  to  solving 

m  n 

Minimize  — ( — S  piX:  +  Ij  d:V:  1 

‘  i-i  '  '  j-i 

Subject  to  — Xj  +  yj  <  Cjj  for  all  i,j 

Xj  >  0,  yj  >  0  for  all  i,j, 

which  is 


Minimize 


146 


m  n 

EpiXi-EdjXj 

1-1  j-i  ' ' 

Subject  to  -Xj  +  yj  <  Cjj  for  all  i,j 
3^1  >  0»  Xj  >  0  for  all  i,j. 

Make  a  change  of  variables  by  letting 

Vj  =s  — Xj  and  U]  =  — X; ,  for  all  i,j . 

Then  we  have  the  equivalent  dual  LP  problem 


Minimize 


E  djVj  —  E 

j-l  ^  *  i-1 


PiUi 


Subject  to  Uj  —  Vj  <  Cjj 


Uj  <  0,  Vj  <  0 


for  all  i,j 
for  all  i,j. 


Using  the  theory  of  complementary  slacks  [64),  if  we  assume  that  the  producers  p|  have 
value  Pi  >  0  for  all  i,  then  we  can  be  guaranteed  that  for  each  i  =  l,...,m,  there  exists  at 
least  one  j  €  ,n}  such  that 

Ui-Vj=Cij, 

and,  hence, 

n 

uj=min{cij +yj},  (7-3) 

where  u  =  (ui,...,Un,)  and  v  =(vi,...,v„)  are  optimal  feasible  solutions.  We  can  rewrite  (7-3) 
in  vector  notation,  as 


u  =  C  X'  V, 

and  Uj,  Vj  <  0. 

To  formulate  this  problem  in  context  of  the  image  algebra,  we  define  X  and  Y  to  be 
non-empty,  finite  coordinate  sets,  i  X|  =  m,  1  y1  =  n.  Define  d  €  (R^)''^  by 


147 


and  define  p 


i=j 

otherwise  ’ 


i  =j 

otherwise  ' 


Define  a  G  R^,  b  G  by 


a(X|)  Uj  (the  variable) 
b(Xi)  =  Vi  (the  variable) . 


Then  we  have: 


LP 

Image  Algebra 

j-i  >  > 

S(b  M  d) 

m 

S(a  M  p) 

i»i 

Now,  define  t  €{Rjj-^by 


We  have  the  equation  u  =  C  x'  v  translates  as  a  =  b  0  t.  Thus,  in  image  algebra  nota¬ 


tion,  the  dual  LP  problem  is 


Minimize 


S(b  El  d)  —  Ij(aE  p) 


Subject  to 


a  =  b  123  t 
a  <  0,  b  <  0 


148 


7.‘>.  An  Tmngp  Complftvit.y  Mpnsnre 

This  section  presents  an  image  complexity  measure,  a  term  used  in  image  processing  to 
describe  any  method  which  provides  a  quantitative  measure  of  some  feature  or  set  of  features 
in  an  image.  Image  complexity  measures  are  used  either  as  a  pre-processing  step  in  which 
the  measures  help  direct  the  selection  of  the  next  processing  step,  or  in  conjuction  with  other 
information  derived  from  the  image  to  identify  objects  of  interest. 

The  measure  investigated  [65]  is  based  on  a  method  discussed  by  Mandelbrot  [66]  for 
curve  length  measurements.  The  original  algorithm  was  modified  and  translated  into  image 
algebra.  The  measure  itself  consists  of  a  graph  which  in  theory  gives  an  indication  of  the 
rate  of  change  of  variation  in  the  gray  level  surface.  The  algorithm  for  computing  the  meas¬ 
ure  is  presented,  followed  by  a  discussion  of  an  application  to  12  outdoor  images. 

The  general  approach  of  the  algorithm  is  to  make  successive  approximations  of  the  area 
of  a  gray  level  surface,  and  then  plot  the  approximations  using  a  log-log  scale.  The  log-log 
scale  is  purported  to  allow  a  better  visual  inspection  of  the  information  contained  in  the 
graph. 

Consider  all  points  with  distance  to  the  gray  level  surface  of  no  more  than  k.  These 
points  form  a  blanket  of  thickness  2k,  and  the  suggested  surface  area  A(k)  of  the  gray  level 
surface  is  the  volume  of  the  blanket  divided  by  2k.  Here  we  have  A(k)  increasing  as  k 
decreases. 

To  begin  the  computation  of  the  surface  area  for  k  =1,2,...,  an  upper  surface  Ujj  and  a 
lower  surface  are  defined  iteratively  in  the  following  manner.  Let  a  be  the  input  image. 
Let 

=  a 

Then  define  U]j  and  bj;  for  k  =  1,2,...,  by 


149 


Ufc  =  Uk_i  0  t 

\  =  bfc_i  E3  -t 

where 

t  = 

The  volume  v(k)  of  the  ’Talanket"  between  the  upper  and  lower  surfaces  is  calculated  for 
each  k  by  computing 


Pi(k)  =  Uk®s,  qi(k)  =  b|t©(-a) 

where 

8  = 

LetVi(k)  =  E(pi(k)+qi(k)). 


This  method  of  estimating  the  volume  was  derived  using  elementary  calculus.  We 
explain  the  method  for  calculating  the  volume  between  the  upper  surface  and  the  coordinate 
set  X.  The  volume  between  the  lower  surface  and  X  is  found  in  a  similar  way.  Given  four 
pixel  locations  in  X,  (i,j),  (i,j+l),  (i+l,j),  and  (i+l,j+l),  a  box  was  constructed  from  the  eight 
points  in  corresponding  to  the  four  gray  values  Ui5(i,j),  U)j(i,j+1),  Ui5(i+l,j),  Ujj(i+1  J+1), 
and  the  four  given  pixels.  Drawing  a  line  from  U|j(i,j)  to  U|5(i+l,j+l)  and  a  line  from  (i,j)  to 
(i+l,j+l),  the  volume  of  the  triangular  column  determined  by  the  six  points 
uij(i,j),  Ui5(i+l,j),  U|t(i+l,j+l),  (i,j),  (i+lj),  and  (i+l,j+l)  was  found  using  elementary 
methods  from  calculus.  Similarly,  the  volume  of  the  triangular  column  determined  by  the  six 
points  Uij(i,j),  U]5(i,j+1),  Ui5(i+l,j+l),  (i,j),  (i,j+l),  and  (i+l,j+i)  was  determ.ined.  The  volume 


150 


of  the  two  pieces  are  added  together  to  give  an  estimate  to  the  volume  of  the  box  determined 
by  the  eight  initial  points.  This  is  done  over  the  entire  coordinate  set  X,  and  all  volumes 
added  together  to  give  an  estimate  of  the  volume  between  X  and  the  gray  value  surface  Ui^. 
The  method  was  expressed  using  the  image  algebra  operation  ©  and  an  invariant  template, 
omitting  the  boundary  effects.  Using  this  approach,  the  volume  is  overestimated,  so  it  is 
corrected  by  applying  a  variant  template  w  effective  only  on  the  edge  pixels.  Define  w  by 

if  X  is  a  top  edge  pixel  and  not  the  top  right  corner  pixel, 

Wx  = 

if  X  is  the  top  right  corner  pixel, 

if  X  is  a  right  edge  pixel  but  not  the  top  right  corner  pixel,  and  =0,  if  x  is  otherwise. 

To  correct  for  the  extra  volume  added  in  on  the  edge  pixels,  we  calculate 

po(k)  =  Ufc  ©  (-w),  q2(k)  =  bk  ©  w 

and  let  volerr(k)  =  S  Ip2(k)  +  q2(k)).  The  ccrroct  volume  v(k)  is 

v(k)  =  Vj(k)  +  volerr(k) . 

The  approximated  surface  area  is 

area(k)  =  -^. 

The  rate  of  change  of  log(area(k))  with  respect  to  log(k)  contains  important  information 
about  the  image.  The  slope  S(k)  of  area(k)  versus  k  is  computed  on  a  log-log  scale  for  each  k 


I 


151 

by  finding  the  best  fitting  straight  line  through  the  three  points 

(log(k-l),  log(area(k-l))),  (log(k),  log(area(k))),  (log(k-J-l),  log(area(k+l))) . 

The  graph  of  S(k)  versus  k  is  called  the  signature  of  the  image.  We  can  also  calculate  a 
signature  for  the  case  where  the  array  X  represents  the  bottom  surface  and  Ujj  the  upper 
surface.  We  call  this  the  upper  signature.  Similarly,  the  signature  which  is  calculated  using 
{b|5 }  for  the  lower  surfaces  and  X  for  the  upper  surfaces  is  called  the/ou;er  signature. 

This  algorithm  was  run  on  12  images.  For  each  image,  we  calculated  the  input  image, 
Uj,  bj,  i  =  1 . 50,  and  the  graph  of  the  upper  and  lower  signatures. 

As  k  increases,  regions  of  pixels  initially  having  the  highest  gray  values  decrease  in  size 
in  the  images  bjj.  However,  as  k  increases,  the  images  U|j  shrink  regions  having  lower  gray 
values.  In  theory,  this  asymmetry  can  be  taken  advantage  of.  Roughly,  the  lower  signature 
represents  the  shape  of  objects  with  high  gray  values,  and  the  upper  signature  represents  the 
distribution  of  objects  throughout  the  image.  The  images  to  which  we  applied  this  method 
were  infrared,  so  we  were  mainly  interested  in  the  lower  signatures. 

The  magnitude  of  the  curve  S(k)  is  related  to  the  information  lost  on  objects  with 
details  less  than  k  in  size.  The  more  gray  level  variation  at  distance  k,  the  higher  the  values 
for  S{k).  Thus,  if  at  small  k  S(k)  is  large,  then  there  is  "high-frequency"  gray  level  variations, 
and  if  at  large  k  S(k)  is  large,  then  we  have  ’low-frequency"  gray  level  variations.  The  curve 
S(k)  thus  gives  us  information  about  the  rate  of  change  of  variations  in  the  gray  level  surface. 

.\fter  running  the  program  on  a  dozen  images,  we  have  concluded  that  this  algorithm  is 
too  sensitive  to  the  great  variance  in  outdoor  scenery.  For  example,  an  image  which  has  a 
background  of  trees  and  no  targets,  and  an  image  which  has  two  distinct  targets  and  no  trees 
as  background  have  similar  graphs  for  the  lower  signatures.  While  the  lower  signature 


152 


represents  more  of  the  shape  of  the  hot  objects  (areas  with  high  gray  values)  in  the  image,  in 
one  image  we  have  no  hot  objects  while  in  the  other,  there  are  two  distinct  hot  objects.  As 
another  example,  in  two  other  images  we  have  a  target  with  a  road  and  a  field  as  back¬ 
ground,  yet  the  graphs  for  the  upper  signatures  for  these  images  have  a  very  distinct 
difference.  The  theory  suggests  that  upper  signatures  should  represent  similar  targets,  but 
we  cannot  draw  that  conclusion  from  this  data.  The  examples  given  in  the  original  paper 
(64)  are  of  a  very  regular  texture  and  are  presented  in  a  controlled  environment.  It  is  very 
likely  that  the  controlled  environment  in  which  the  data  was  taken  is  one  reason  why  this 
algorithm  was  successful  for  those  authors. 

The  initial  goal  of  investigating  this  type  of  complexity  measure  was  that  these  graphs 
would  hopefully  give  a  measure  of  gray  level  variation  within  an  image  and  help  in  choosing 
a  more  effective  edge  operator.  If  an  image  has  a  high  incidence  of  gray  level  variation  at 
small  values  of  k,  then  it  is  reasonable  to  assume  a  more  sensitive  mask,  such  as  the  gradient 
mask,  would  give  better  results.  Otherwise,  if  an  image  had  small  values  of  S(k)  at  small 
values  of  k,  then  computation  time  could  be  saved  by  using  a  Sobel  operator  instead  of  a 
computationally  intensive  edge  operator  such  as  the  Kirsch.  Unfortunately,  the  algorithm 
did  not  produce  data  that  leads  to  this  conclusion. 


CONCLUSIONS  AND  SUGGESTIONS  FOR  FURTHER  RESEARCH 


We  have  shown  that  the  establishment  of  an  isomorphism  between  a  lattice-based 
matrix  theory  and  the  image  algebra  provides  a  powerful  tool  in  image  processing  which  was 
not  available  previous  to  this  research.  Just  as  linear  transforms  are  able  to  be  represented 
as  matrices  within  the  structure  of  linear  algebra,  lattice  transforms  are  able  to  be 
represented  as  matrices  within  the  structure  of  the  minimax  algebra,  and  thus  all  mathemati¬ 
cal  results  of  the  minimax  algebra  are  applicable  to  solving  problems  in  image  processing. 

In  particular,  we  have  shown  that 

(1)  Many  notions  encountered  in  investigating  linear  transforms,  such  as  the 
eigenvalue/eigenvector  problem,  rank,  linear  independence,  and  solutions  to  systems  of 
equations,  have  their  counterpart  in  image  algebra.  The  use  of  these  notions  to  solving 
specific  image  processing  problems  remains  to  be  seen. 

(2)  The  image  algebra  is  useful  as  a  model  in  mapping  a  class  of  non-linear  transforms  to 
parallel  architectures.  It  is  feasible  to  map  any  arbitrary  lattice  transform  to  most 
parallel  architectures;  that  is,  given  a  network  of  processors  that  are  interconnected  by 
communication  links,  a  lattice  transform  lias  a  weak  decomposition  into  a  product  of 
lattice  transforms  that  are  each  local  with  respect  to  the  network  in  and  only  if  every 
pair  of  processors  has  a  two-way  path  of  communication  between  them.  There  is  at 
most  one  weak  (pointwise)  operation  of  maximum,  with  the  remaining  operations  being 
convolutions.  The  pointwise  ma.vimum  which  represents  the  fact  that  a  small  amount 
of  storage  is  needed  to  compute  the  transform  Is  not  an  unreasonable  restriction. 


154 


(3)  Mathematical  morphology  is  a  special  subclass  of  the  lattice  transforms,  namely  the 
transforms  corresponding  to  invariant  templates  from  X  to  X  having  the  target  pixel 
as  part  of  their  support  at  each  pixel.  Thus,  the  lattice  subalgebra  of  the  image  alge¬ 
bra  generalizes  mathematical  morphology.  The  image  algebra  is  a  clear  and  simple 
mathematical  representation  of  morphological  concepts,  avoiding  the  cumbersome 
notion  of  umbra,  for  example. 

(4)  The  minimax  algebra  lends  itself  well  to  the  investigation  of  matrix  decomposition 
techniques.  A  weak  decomposition  was  shown  for  any  matrix  lattice  transform.  Thus, 
the  application  of  a  matrbc  product  for  any  purpose  can  be  implemented  in  parallel. 
This  includes  problems  from  operations  research. 

(5)  Although  the  minimax  algebra  is  not  a  Euclidean  domain,  the  representation  of  a  type 
of  division  algorithm  was  found.  In  boolean  images,  there  are  clear  applications  of 
these  types  of  skeletonizing  techniques  to  image  compression. 

We  now  list  some  suggestions  for  further  research.  The  author  continues  to  pursue  this 
general  area  of  research,  and  would  appreciate  being  informed  of  any  results  obtained  related 
to  the  following  problems. 

Unlike  linear  algebra,  the  minimax  algebra  is  relatively  unknown.  The  matrix  proper¬ 
ties  developed  so  far  in  the  literature  [39]  were  meant  primarily  for  use  in  operations  research 
problems.  A  wealth  of  mathematical  results,  with  possible  applications  to  real  world  prob¬ 
lems,  are  obviously  obtainable  from  investigating  other  uses  of  the  minimax  algebra.  The 
matrix  decomposition  techniques  presented  in  this  dissertation  are  a  clear  example.  Specific 
problems  are;  investigate  Chapter  3’s  statements  for  applications  to  solving  image  processing 
problems.  What  specific  applications  does  the  eigenproblem  have  in  image  processing?  The 
statements  in  Chapter  3  are  only  a  few  of  the  minimax  algebra  properties  that  were  mapped 


155 


to  image  algebra.  Investigate  other  properties  in  the  Minimax  Algebra  book  which  may  have 
uses  in  image  processing.  Is  there  a  minimax  transform  equivalent  to  the  linear  Fourier 
transform?  What  are  its  uses?  What  other  types  of  linear  transforms  have  a  minimax 
equivalent?  Can  other  techniques  in  linear  algebra,  such  as  the  SVD,  Kronecker  products, 
etc.  be  described  in  a  similar  way  in  the  minimax  algebra?  Develop  techniques  for  decompo¬ 
sition  of  square  matrices  that  are  block  toeplitz  with  toeplitz  blocks,  with  non-null  diagonal 
entries.  As  mentioned  in  Chapter  4,  this  type  of  matrix  corresponds  to  the  concept  of  a 
structuring  element  in  mathematical  morphology,  and  to  a  class  of  invariant  templates  in  the 
image  algebra. 

Another  question  concerns  the  existence  theorem.  Theorem  5.24.  Does  there  exist  a 
local  decomposition  that  is  not  weak,  that  is,  a  decomposition  with  no  operation  of  V?  This 
would  relieve  the  storage  requirement  associated  with  V.  Also,  is  there  a  similar  existence 
theorem  with  the  digraph  D(W)  replacing  the  graph  G(W)?  Although  the  existence  theorem 
given  in  Chapter  5  is  constructive,  one  would  rarely  implement  the  decomposition  in  this 
manner.  Describe  a  general  decomposition  technique  giving  local  templates  and  which  is 
minimal  with  respect  to  some  criterion,  such  as  the  number  of  factors  in  the  decomposition. 
Also,  what  type  of  applications  to  operations  research  does  the  decomposition  theorem  have? 

The  continuous  cases  for  the  image  algebra  operands  ©,  M  ,  and  ®  have  yet  to  be  well 
established.  Establish  the  mathematical  foundations  for  the  continuous  counterparts  involv¬ 
ing  the  lattice  operations  M  ,  ® ,  and  V. 

In  his  dissertation  Gader  a-sk.s  the  qiie.stion  about  relating  circulant  templates  under  El 
or  ®  to  group  algebras.  Will  the  formal  minimax  matrix  relationship  established  with  the 
image  algebra  make  the  pursuit  of  this  particular  question  any  easier? 


155 


Is  there  a  practical  use  for  the  division  algorithm?  Generalize  the  division  algorithm  to 
a  sequence  of  templates  t|  replacing  the  single  t. 


157 


REFERENCES 


1.  J.  von  Neumann,  “The  General  Logical  Theory  of  Automata,”  in  Cerebral  Mechanism 
in  Behavior:  The  Hixon  Symposium,  Wiley  and  Sons,  New  York  (1951). 

2.  S.H.  Unger,  “A  Computer  Oriented  Toward  Spatial  Problems,”  Proc.  IRE  46  (1958), 
1744-1750. 

3.  K.E.  Batcher,  “Design  of  a  Massively  Parallel  Processor,”  IEEE  Trans.  Computers 
29(9)  (1980),  836-840. 

4.  M.J.B.  Duff,  “CLIP4,”  in  Special  Computer  Architectures  for  Pattern  Processing,  ed. 
K.S.  Fu,  CRC  Press,  Boca  Raton,  FL  (1982). 

5.  T.J.  Fountain,  K.N.  Matthews,  and  M.J.B.  Duff,  “The  CLIP7A  Image  Processor,” 

IEEE  Pattern  Analysis  and  Machine  Intelligence  10(3)  (May  1988). 

6.  L.  Uhr,  “Pyramid  Multi-Computer  Structures,  and  Augmented  Pyramids,”  in  Comput¬ 
ing  Structures  for  Image  Processing,  ed.  M.J.B.  Dulf,  Academic  Press,  London  (1983). 

7.  W.D.  Hillis,  The  Connection  Machine,  The  MIT  Press,  Cambridge,  MA  (1985). 

8.  J.C.  Klein  and  J.  Serra,  “The  Texture  Analyzer,”  J.  Micros  95  (1972),  349-356. 

9.  S.R.  Sternberg,  “Biomedical  Image  Processing,”  Computer  16(1)  (January  1983),  22-34. 

10.  D.L.  McCubbrey  and  R.M.  Lougheed,  “Morphological  Image  Analysis  Using  a  Raster 
Pipeline  Processor,”  pp.  444-452  in  IEEE  Comp.  Soc.  Workshop  on  Comp.  Arch,  for 
Patt.  Analysis  and  Image  Database  Management,  Miami  Beach,  FL  (1985). 

11.  E.  Cloud  and  W.  Holsztynski,  “Higher  Efficiency  for  Parallel  Processors,”  pp.  416-422 
in  Proceedings  IEEE  Southcon  Orlando,  FL  (March,  1984). 

12.  H.  Minkowski,  Gesammelte  Abhandlungen,  Teubner  Verlag,  Leipzig-Berlin  (1911). 

13.  H.  Minkowski,  “Volumen  und  Oberflache,”  Mathematische  Atinalen  57  (1903),  447-495. 

14.  Hadwiger,  Vorlesungen  uber  Inhalf,  Oberflache  und  Isoperimetrie,  Springer-Verlag, 
Berlin  (1957). 

15.  G.  Matheron,  Random  Sets  and  Integral  Geometry.  Wiley.  New  York  (1975). 

16.  J.  Serra.  Image  Analysis  and  Mathematical  Morphology,  .\cademic  Press.  London 
(1982). 

17.  T.R.  Crimmins  and  W.M.  Brown.  “Image  .\lgebra  and  .-Vutomatic  Shape  Recognition,” 
IEEE  Trans,  .\erospace  and  Elec.  Systems  AES-21(1)  (Jan.  1985),  60-69. 

18.  R.M.  Haralick,  L.  Shapiro,  and  J.  Lee,  “Morphological  Edge  Detection,”  IEEE  Journal 
of  Robotics  and  .Automation  RA-3(l)  (.\pril  1987),  142-157. 

19.  R.M.  Haralick,  S.R.  Sternberg,  and  X.  Zhuang,  “Image  Analysis  Using  Mathematical 
Morphology;  Part  1,”  IEEE  Trans,  on  Patt.  .{naly.'iis  and  .Mach.  Intelligence  PAMI- 
9(4)  (July  1987),  532-550. 


158 


20.  P.  Maragos  and  R.W.  Schafer,  “Morphological  Skeleton  Representation  and  Coding  of 
Binary  Images,”  IEEE  Trans.  Acoustics,  Speech,  and  Signal  Proc.  ASSP-34(5)  (Oct. 
1986),  1228-1244. 

21.  P.  Maragos  and  R.W.  Schafer,  “Morphological  Filters  Part  I:  Their  Set-Theoretic 
Analysis  and  Relations  to  Linear  Shift-Invariant  Filters,”  IEEE  Trans.  Acoustics, 
Speech,  and  Signal  Proc.  ASSP-35  (Aug.  1987),  1153-1169. 

22.  P.  Maragos  and  R.W.  Schafer  ,  “Morphological  Filters  Part  II:  Their  Relations  to 
Median,  Order-Statistic,  and  Stack  Filters,”  IEEE  Trans.  Acoustics,  Speech,  and  Sig¬ 
nal  Proc.  ASSP-35  (Aug.  1987),  1170-1184. 

23.  S.R.  Sternberg,  “Language  and  Architecture  for  Parallel  Image  Processing,”  in  Proc.  of 
the  Conf  on  Pattern  Rec.  in  Practice,  Amsterdam  (May  1980). 

24.  S.R.  Sternberg,  “Overview  of  Image  Algebra  and  Related  Issues,”  in  Integrated  Tech¬ 
nology  for  Parallel  Image  Processing,  ed.  S.  Levialdi,  Academic  Press,  London  (1985  ). 

25.  P.  Maragos,  “A  Unified  Theory  of  Translation-Invariant  Systems  With  Applications  to 
Morphological  Analysis  and  Coding  of  Images,”  Ph.D.  thesis,  Georgia  Inst.  Tech., 
Atlanta  (1985). 

26.  P.E.  Miller,  “Development  of  a  Mathematical  Structure  for  Image  Processing,”  Optical 
Division  Tech.  Report,  Perkin-Elmer  (1983). 

27.  R.  Hockney  and  C.  Jesshope,  Parallel  Computers  :  Architecture,  Programming,  and 
Algorithms,  Adam  Hilger,  Bristol  (1981). 

28.  J.T.  Schwartz,  “A  Taxonomic  Table  of  Parallel  Computers,  Based  on  55  Designs,” 
Technical  Report,  Courant  Institute  of  Mathematical  Sciences,  New  York  University 
(1983). 

29.  A.  V.  Oppenheim  and  R.  W.  Schafer,  Digital  Signal  Processing,  Prentice-Hall  Inc., 
Englewoc^  Cliffs,  NJ  (1975). 

30.  S.R.  Sternberg,  “Languages  and  Architectures  for  Parallel  Image  Processing,”  in  Pat¬ 
tern  Recognition  in  Practice,  North-Holland  Publishing  Co.,  New  York,  NY  (1980). 

31.  0.  Ersoy,  “Semisystolic  Array  Implementation  of  Circular,  Skew  Circular,  and  Linear 
Convolutions,”  IEEE  Trans,  on  Computers  C-34(2)  (February  1985),  190-196. 

32.  G.X.  Ritter,  J.N.  Wilson,  J.L.  Davidson,  “Image  Algebra:  An  Overview,”  to  appear  in 
Computer  Vision,  Graphics,  and  Image  Processing,  (1989). 

33.  P.D.  Gader,  “Image  Algebra  Techniques  for  Parallel  Computation  of  Discrete  Fourier 
Transforms  and  General  Linear  Transforms,”  Ph.D.  Dissertation,  University  of 
Florida,  Gainesville,  FL  (1986). 

34.  G.X.  Ritter,  J.N.  Wilson,  and  J.L.  Davidson,  “Standard  Image  Processing  Algebra 
Document  Phase  II,”  TR  (7)  Image  Algebra  Project,  F08635-84-C-0295,  Eglin  AFB, 

FL  (1987). 

35.  J.  Serra,  Image  Annli''^i.t  and  Mathematical  Morphology,  Volume  Z:  Theoretical 
Advances,  Academic  Press,  New  York  (1988). 

36.  Birkhofl",  Lattice  Theory,  American  Mathematical  Society,  Providence,  RI  (1984). 

37.  R.  Cuninghame-Green,  “Process  Synchronisation  in  Steelworks  -  a  Problem  of  Feasibil¬ 
ity,”  pp.  323-328  in  Proc.  2nd  hit.  Conf  on  Oper.  Research,  ed.  Banbury,  English 
University  Press,  London  (1960). 


159 


38.  R.  Cuninghame-Green,  “Describing  Industrial  Processes  with  Interference  and  Approxi¬ 
mating  their  Steady-State  Behaviour,”  Oper.  Research  Quart.  13  (1962),  95-100  . 

39.  R.  Cuninghame-Green,  Minimax  Algebra:  Lecture  Notes  in  Economics  and  Mathemati¬ 
cal  Systems  166,  Springer-Verlag,  New  York  (1979). 

40.  J.E.  Cohen,  “Subadditivity,  Generalized  Products  of  Random  Matrices  and  Operations 
Research,”  SIAJvI  Review,  (March  1988),  69-86. 

41.  G.  Birkhoff  and  J.  Lipson,  “Heterogeneous  Algebras,”  J.  Combinatorial  Theory  8 
(1970),  115-133. 

42.  G.X.  Ritter,  M.A.  Shrader-Frechette,  and  J.N.  Wilson,  “Image  Algebra:  A  Rigorous 
and  Translucent  Way  of  Expressing  All  Image  Processing  Operations,”  in  Proc.  of  the 
1987  SPIE  Tech,  Symp.  Southeast  on  Optics,  Elec.-Opt.,  and  Sensors,  Orlando,  FL 
(May  1987). 

43.  G.X.  Ritter  and  J.N.  Wilson,  “Image  Algebra:  A  Unified  Approach  to  Image  Process¬ 
ing,”  in  Proceedings  of  the  SPIE  Medical  Imaging  Conference,  Newport  Beach,  CA 
(February  1987). 

44.  P.D.  Gader,  “Necessary  and  Sufficient  Conditions  for  the  Existence  of  Local  Matrix 
Decompositions,”  SLAM  Journal  on  Matrix  Analysis  and  Applications,  (July  1988), 
305-313. 

45.  G.X.  Ritter  and  P.D.  Gader,  “Image  Algebra  Techniques  for  Parallel  Image  Process¬ 
ing,”  Journal  of  Parallel  and  Distributed  Computing  4(5)  (March  1987),  7-44. 

46.  A.  Shimbel,  “Structure  in  Communication  Nets,”  pp.  119-203  in  Proc.  Symp.  on  Infor¬ 
mation  Networks,  Polytechnic  Institute  of  Brooklyn  (1954). 

47.  B.  Giffler,  “Mathematical  Solution  of  Production  Planning  and  Scheduling  Problems,” 
IBM  ASDD  Tech.  Rep.  (1960). 

48.  V.  Peteanu,  “An  Algebra  of  the  Optimal  Path  in  Networks,”  Mathematica  9  (1967), 
335-342. 

49.  C.  Benzaken,  “Structures  Algebra  des  Cheminements,”  pp.  40-57  in  Network  and 
Switching  Theory,  ed.  Biorci,  Academic  Pres  (1968). 

50.  B  Carre,  “An  Algebra  for  Network  Routing  Problems,”  J.  Inst.  Math.  Appl.  7  (1971), 
273-294  . 

51.  R.C.  Backhouse  and  B  Carre,  “Regular  Algebra  Applied  to  Path-Finding  Problems,”  J. 
Inst.  Math.  Appl.  15  (1975),  161-186. 

52.  G.X.  Ritter,  M.A.  Shrader-Frechette,  and  P.D.  Gader,  “Image  Algebra  Tutorial,  Ver¬ 
sion  I,”  TR  Image  Algebra  Project,  F08635-84-C-0295,  Eglin  AFB.  FL  (1985). 

53.  M.  Yoeli,  “A  Note  on  a  Generalization  of  Boolean  Matrix  Theory,”  Am.  Math.  Monthly 
68  (1961),  552-557. 

54.  G.  Matheron,  Elements  pour  une  tlieorie  des  milieux  poreuxl%7. 

55.  J.  Serra,  “Introduction  a  la  Morphologic  Mathematique,”  Booklet  No.  3,  Cahiers  du 
Centre  de  Morphologic  Mathematique,  Fontainebleau,  France  (1969). 

56.  S.  Sternberg,  “Cellular  Computers  and  Biomedical  Image  Processing,”  in  Proc.  U.S.  - 
France  Seminar  on  Biomed.  Image  Processing,  Grenoble,  France  (1980). 


160 


57.  J.  Serra,  “Morphologic  pour  les  fonctions  "a  peu  pres  en  tout  ou  rien'V’  Technical 
Report,  Cahiers  du  Centre  de  Morphologic  Mathematique,  Fontainebleau,  France 
(1975). 

58.  F.  Meyer,  “Iterative  image  transformation  for  an  automatic  screening  of  cervical 
smears,”  Journal  of  Histochem.  and  Cytochem.  27(1)  (1978),  128-135. 

59.  S.R.  Sternberg,  “E.soteric  Iterative  Algorithms,”  in  II Internat’i  Conf.  on  Image 
Analysis  and  Processing,  Brindisi,  Italy  (Nov.  1982). 

60.  G.X.  Ritter,  J.N.  Wilson,  and  J.L.  Davidson,  “Image  Algebra  Application  to  Image 
Lfeasurement  and  Feature  Extraction,”  in  Proc.  of  the  1989  SPIE  OE/LASE  1989 
Optics,  Elec.-Optics,  and  Laser  Appl.  in  Sci.  and  Eng.,  Los  Angeles  (Jan.  1989). 

61.  G.X.  Ritter,  Dong  Li,  “Template  Decomposition  and  Image  Algebra,”  UF-CIS  Techni¬ 
cal  Report,  Dept,  of  Comp,  and  Info.  Sci.,  Univ.  of  Florida,  Gainesville,  FL  (1989). 

62.  J.B.  Fraleigh,  A  First  Course  in  Abstract  Algebra,  Addison-Wesley,  Reading,  MA 
(1968). 

63.  P.E.  Miller,  “An  Investigation  of  Boolean  Image  Neighborhood  Transformations,” 
Ph.D.  dissertation,  Ohio  State  University  (1978). 

64.  K.G.  Murty,  Linear  and  Combinatorial  Programming,  John  Wiley,  New  York  (1976). 

65.  S.  Peleg  and  et.  al.,  “Multiple  Resolution  Texture  Analysis  and  Classification,”  TR, 
Center  for  Automation  Research,  University  of  Maryland,  College  Park,  MD  (July 
1983). 

66.  B.B.  Mandelbrot,  The  Fractal  Geometry  of  Nature,  Freeman,  San  Francisco  (1983). 


BIOGRAPHICAL  SKETCH 


Jennifer  L.  Davidson  was  born  in  Dayton,  Ohio,  and  grew  up  in  the  Washington,  D.C. 
area.  She  received  the  B.A.  in  physics  from  Mount  Holyoke  College,  Massachusetts,  and  the 
M.S.  in  mathematics  from  the  University  of  Florida  in  1986.  Her  research  interests  include 
applied  mathematics,  image  processing,  and  computer  vision,  and  she  is  the  author  of  over  17 
research  papers  including  a  journal  publication. 

Ms.  Davidson  is  a  member  of  the  IEEE  Computer  Society,  Society  for  Photo-Optical 
Instrumentation  Engineers,  Society  of  Industrial  and  Applied  Mathematics,  Mathematical 
Association  of  America,  and  the  American  Mathematical  Society. 


161 


I  certify  that  I  have  read  this  study  and  that  in  my  opinion  it  conforms  to  acceptable 
standards  of  scholarly  presentation  and  is  fully  adequate,  in  scope  and  quality,  as  a  disserta¬ 
tion  for  the  degree  of  Doctor  of  Philosophy. 


lard  X.  Ritter,  Chairman 
^rofessor  of  Mathematics 
Professor  of  Computer  and  Information  Science 


I  certify  that  I  have  read  this  study  and  that  in  my  opinion  it  conforms  to  acceptable 
standards  of  scholarly  presentation  and  is  fully  adequate,  in  scope  and  quality,  as  a  disserta¬ 
tion  for  the  degree  of  Doctor  of  Philosophy. 


X- 

ecliner 


Beverly  L.  Breen 
Professor  of  Mathematics 


I  certify  that  I  have  read  this  study  and  that  in  my  opinion  it  conforms  to  acceptable 
standards  of  scholarly  presentation  and  is  fully  adequate,  in  scope  and  quality,  as  a  disserta¬ 
tion  for  the  degree  of  Doctor  of  Philosophy.  j  j 

James  B.  Keesling 
^^ofessor  of  Mathematics 


I  certify  that  I  have  read  this  study  and  that  in  my  opinion  it  conforms  to  acceptable 
standards  of  scholarly  presentation  and  is  fully  adequate,  in  scope  and  quality,  as  a  disserta¬ 
tion  for  the  degree  of  Doctor  of  Philosophy. 

Li-Chien  Shen 

Associate  Professor  of  Mathematics 


I  certify  that  I- have  read  this  study  and  that  in  my  opinion  it  conforms  to  acceptable 
standards  of  scholarly  presentation  and  is  fully  adequate,  in  scope  and  quality,  as  a  disserta¬ 
tion  for  the  degree  of  Doctor  of  Philosophy. 


David  C.  Wilson 
Professor  of  Mathematics 


APPENDIX  F 


DoD  FZLLONSBIP  FORMS 


Universal  Energy  Systems,  Inc 


— -Date- 


UES  Project  789 
DOD  Pel lews 


ADDRESS 


SUBJECT:  Contract  F49620-86-C-0127/SB5S61-0436 

Proposed  Subcontract  S-789-IX3D-000 


Dear  _ ; 

Enclosed  are  an  original  and  one  copy  of  the  subject  subcontract  agreement.  The 
total  price  is  $  17 , 500 

This  Subcontract  Agreement  must  be  signed  by  an  official  of  your  organization 

who  is  authorized  to  bind  the  University  of  _  in  a  legal 

contract. 

Please  sign  and  return  the  original  agreement  as  soon  as  pxDssible,  but  no  later 
than  10  days  after  the  date  of  this  letter.  This  agreement  will  be  signed,  and 
a  copy  of  the  fully  executed  document  will  be  returned  to  you  for  your  files. 

If  you  have  any  contractual  questions,  please  call  me.  For  technical  questions, 
please  contact  Mr.  Rodney  C.  Darrah,  Project  Manager. 

Sincerely, 

UNIVERSAL  ENERGY  SYSTEMS,  INC. 


Susan  K.  Espy 
Contract  Administrator 

xc:  DCASMA/Dayton  (M.  Hughes) 

UES  Projec:t  Manager 
UES  Suspense  (Ltr  Only) 
Fellow 


SKE/mt 

2329s 


4401  Dayton-Xenia  Rd.  •  Dayton,  Ohio  45432-1894  •  phone  513/426-6900  •  FAX  513/429-5413 

Cable  :  ENERGY  •  Telex:  288250 


SUBC(WRACT  AGREEMENT  1 

Paoe  1  of  7 

Subcontract  No.: 

Prime  Contract  No.: 

Certified  For  National 

S-7a9-DOD-OCX5 

F49620-a6-C-0127 

Defense  Use  Under 

SB586i-0436 

DPAS  (15  CFR  350). 

PRIORITY  RATING:  DO  C-9 

Universal  Energy  Systems,  Inc. 
4401  Dayton-Xenia  Road 
Dayton ,  OH  45432 
Project  Manager;  Rod  Par rah 


Contract  Type: 
FIXED  PRICE 


Subcon  trac  tor 


Total  Amount:  $17,500.00 


OPTION 

OPTION 


I  -  22,000 

II  -  25.000 


Subcontract  Effective  Date: 
1  September  1989 _ 


Security  Classification:  U 
Date  of  DD  Form  254:  N/A 


TABLE  OF  CONTENTS 


(X) 

SEC 

ITEM 

PAGE 

PART  I -SCHEDULE 

X 

A 

Contract  Form 

2 

X 

B 

SuDDlies/Services  &  Prices 

2-3 

X 

C 

Descriotion/Soecif icacions 

X 

D 

Packaoino  &  Markina 

X 

E 

Inspection  &  Acceotance 

3 

X 

F 

Deliveries  or  Performance 

! 

X 

G 

Administration  Data 

X 

H 

Special  Provisions 

5-7 

PART  II  -  GENERAL  PROVISIONS 

X 

I 

General  Provisions 

7-8 

Part  III  -  LIST  OF  ATTACHMENTS 

1 

X 

J 

Documents,  Exhibits,  Attachments 

8 

Subcontractor  agrees  to  furnish  and  deliver  all  items  or  perform  all 
the  services  set  forth  or  otherwise  identified  in  this  subcontract 
agreement  for  the  consideration  stated. 

SUBCONTRACTOR:  _ 

Date 

By: _ Signed: _ 

(Signature  of  Person  Authorized  to  Sign) 

Name  &  Title: _ 

LiMIVERSAL  ENERGY  SYSTEMS,  INC.: 

Date 

By: _  Signed: _ 

(Signature) 


Name  &  Title:  Rodney  C.  Darrah.  Vice  President.  Scientific  Services 


SECTION  A 
CONTRACT  FORM 


This  Subcontract  Agreement  is  entered  into  as  of  the  ist  day  of  September 
1989,  between  Universal  Energy  Systems,  Inc.,  an  Ohio  corporation  with 
offices  in  Dayton,  Ohio,  (hereinafter  referred  to  as  UES)  and  the 
(University)  (hereinafter  referred  to  as  Subcontractor). 

Whereas  UES  has  entered  into  prime  contract  No.  F49620-86-C-0127 
( SB5861-0436 )  with  the  United  States  Government  pursuant  to  which  UES  is 
obligated  to  furnish  Services  and  Related  Personnel  to  conduct  special 
studies  involving  U.S.  Graduate  Students  pursuing  Doctoral  Degrees  in  Air 
Force  Research  Activities,  and 

Whereas  UES  desires  to  have  Subcontractor  perform  a  portion  of  the  work  and 
Subcontractor  desires  to  assume  the  obligation  to  perform  such  portion  of 
the  work,  subject  to  the  terms,  conditions,  and  provisions  of  this 
subcontract  agreement: 

Now  therefore,  in  consideration  of  the  foregoing  and  the  covenants  herein 
contained,  the  parties  agree  as  set  forth  herein. 

Work  Location.  Work  will  be  performed  at  the  (University)  in  support  of 
the  Department  of  Defense  National  Defense  Science  &  Engineering  Graduate 
Fellowship  Program  and  (Laboratory). 

SECTION  B 

SUPPLIES.  SERVICES.  AND  PRICES 
The  following  services  shall  be  provided: 

Educational  and  research  facilities,  academic  guidance  and  coursework  in 
support  of  a  Special  Study  in  (Research) .  (Fel low)  is  assigned  to  the 
( Laboratory )  at  (Air  Force  Base).  (I'lentor)  is  the  Laboratory  Advisor. 

(Advisor) .  Department  of  _ ,  is  the 

Academic /Dissertation  Advisor. 

For  services  rendered,  as  herein  requested,  UES  will  pay  compensation  as 
fol lows: 


Fellow's  Stipends: 


In  lieu  of 
Fel lowship 


Fellow's  Tuition  and  Fees: 
Processing  Fee: 


TOTAL 


$14,000  ($10,5(X)  Academic 
Yea( — $3,500  Summer 
Session ) 

$  6,000  (First  Year) (12  months) 
$  1.000  (First  Year) (12  months) 
$21,000 


□pticDn  I  -  For  second  year  services  rendered  as  herein  requested,  UES  will 
pay  compensation  as  follows: 


Fellow's  Stipends: 


In  lieu  of  Fellow's  Tuition  and  Fees: 
Fellowship  Processing  Fee: 

TOTAL 

Option  II  -  For  third  year  services 
shall  pay  compensation  as  follows: 

Fellow's  Stipends: 


In  lieu  of  Fellow's  Tuition  and  Fees: 
Fellowship  Processing  Fee: 

TOTAL 


$15,CXX)  ($11,250  Academic 
Year — $3,750  Summer 
Session ) 

$  6,000  (Second  Year) (12  months) 
$  1,000  (Second  Year) (12  months) 
$22,000 

rendered  as  herein  requested,  UES 


$16,000  ($12,000  Academic 
Year-$4,000  Summer 
Session ) 

$  6,000  (Third  Year) (12  months) 
$  1.000  (Third  Year) (12  months) 
$23,000 


SECTION  C 

DESCRIPTION/SPECIFICATIONS 


The  sutxrontractor  as  an  independent  subcontractor,  and  not  as  an  agent  of 
the  contractor,  shall  perform  research  or  services  as  set  forth  in  the 
Fellow's  letter,  "National  Defense  Science  and  Engineering  Graduate 
Fellowship  Conditions  Form,  Additional  Opportunities  Under  the  Air  Force 
Laboratory  Sponsorship  and  the  Air  Force  Laboratory  Sponsorship  Response 
Form,"  dated  _ ,  Attachment  1  hereto. 

SECTION  D 

PACKAGING  &  NARKING 


N/A 


SECTION  E 

INSPECTION  &  ACCEPTANCE 
N/A 

SECTICDN  F 

DELIVERAB-ES  AND  PERFORTWCE  PERIOD 

Period  of  performance  is  from  1  September  1989  through  31  August  1990. 
This  Subcontract  Agreement  shall  terminate  at  12:01  a.m.  on  1  September 
1990  unless  extended  by  written  agreement.  The  first  year  of  the 
fellowship  tenure  is  to  be  completed  on  or  before  31  August  1990. 


Deliverables  are  as  follows: 


a.  Statement  of  Progress:  To  be  submitted  by  the  Fellow's  Academic 
Advisor  through  the  Subcontractor  at  the  completion  of  each  academic 
term.  (Forms  at  Attachment  2  hereto). 

b.  Publications  Prepared  By  the  Fellow:  Tv-ra  (2)  copies  of  any 

publications  prepared  by  the  Fellow  to  be  submitted  by  ttie  Fellow 
through  the  subcontractor. 

c.  Dissertation :  Complete  copy  of  dissertation  covering  area  of 

(Research)  to  be  submitted  by  the  Fellow  through  the  Subcontractor. 

d.  Transcripts;  To  be  submitted  by  the  Fellow  through  the  Subcontractor 
at  the  completion  of  each  academic  term. 


All  deliverables  shall  be  delivered  to  the  LES  office  located  at 
4401  Dayton-Xenia  Road,  Dayton,  Ohio,  45432,  ATTN;  Nr.  Rodney  C.  Darrah. 

SECTION  G 

ADMINISTRATIVE  DATA 


PAYMENT; 

a.  Initial  payment  for  the  Academic  Year  will  be  made  by  UES  to  the 

Subcontractor  within  45  days  after  the  date  of  LES's  signature  on  page 

i  of  this  SutDContract.  This  initial  sum,  specified  in  Section  H, 
paragraph  9,  includes  payment  in  lieu  of  tuition  for  the  Academic  Year 
and  summer  term  1989-1990  (12  months),  stipend  for  Academic  Year  and 
summer  term  1989-1990  (12  months),  and  $1,000.00  for  the  Fellowship 
Processing  Fee. 

b.  Payment  for  each  subsequent  summer  term  and  Academic  years  will  be 

made  by  UES  to  the  Subcontractor  within  45  days  after  execution  of 

the  modification  to  fund.  Payments  will  include  stipend  for  the 
period . 


SECTION  H 
SPECIAL  PROVISIONS 

1.  In  the  event  Subcontractor  should  violate  the  terms  of  this  subcontract 
agreement  UES,  at  its  option,  may  terminate  this  subcontract  agreement  upon 
verbal  notification  to  the  subcontractor  which  will  be  confirmed  in 
writing.  UES  shall  be  under  no  obligation  except  to  pay  Subcontractor  such 
compensation  as  Subcontractor  may  be  entitled  to  receive  up  to  the  time  of 
such  termination. 

2.  The  Fellow  is  expected  to  complete  all  doctoral  requirements  within  three 
(3)  years  from  1  September  1989. 

3.  As  set  forth  in  Section  B.2  above  payment  of  $6,000  is  made  in  lieu  of 
yearly  (12  month)  tuition  and  fees. 

4.  The  normal  tenure  of  this  fellowship  is  twelve  (12)  months  each  fellowship 
year.  The  twelve  (12)  month  tenure  may  be  reduced  to  no  less  than  nine  (9) 
months  with  forfeiture  of  stipend  for  the  remaining  three  (3)  months  of  the 
fellowship  year.  In  the  event  this  occurs  this  subcontract  will  be 
modified  to  decrease  the  stipend  set  forth  in  Section  B.2  above. 

5.  The  availability  of  funding  for  years  two  and  three  is  contingent  on  a) 
certification  to  UES  by  the  (University)  that  satisfactory  academic 
progress  toward  a  Ph.D  in  (Research)  is  being  made  by  (Fellow) .  and  b)  the 
availability  of  appropriated  Air  Force  funds  for  continued  support.  If 
this  Fellowship  is  continued  beyond  the  first  year  this  Subcontract  will 
be  amended  accordingly.  If  amended,  the  second  year  stipend  will  be 
$15,000  ($11,250  Academic  year  -  $3,750  for  summer  session)  and  the  third 
year  stipend  will  be  $16,000  ($12,000  Academic  year  -  $4,000  summer 
session).  Payment  for  tuition  and  fees  will  remain  unchanged  for  second 
and  third  year. 

6.  In  the  event  the  Government  should  desire  significant  changes  which  would 
affect  price,  delivery  schedule,  or  terms  and  provisions  in  the  overall 
project's  work  content  or  scope,  this  subcontract  may  be  amended 
appropriately  in  accordance  with  Section  H,  Provision  Number  14  of  this 
agreement. 

7.  (FFP  Contract) 

The  contract  price  is  $17,500.00. 

8.  OPTION  PROVISICIN;  UES  is  hereby  granted  the  right  to  obtain  the 
performance  of  the  work  described  in  Section  B,  SUPPLIES,  SERVICES  and 
PRICES.  LES  may  require  the  subcontractor  to  perform  Options  at  ttne  fixed 
price  set  forth  in  Section  B  by  issuing  a  unilateral  subcontract 
modification  to  the  subcontractor  on  or  before  1  September  1990  for  Option 
I  and  1  September  1991  for  Option  1 1 . 


6 


9 


.  LIMITATION  CF  UES  OBLIGATION;  Of  the  total  subcontract  price,  the  sum  of 
$17,500.00  is  presently  available  for  payment  and  is  allotted  to  this 
subcontract.  It  is  anticipated  that  from  time  to  time  additional  funds 
will  be  allotted  to  this  subcontract  until  the  total  price  of  this 
subcontract  is  allotted.  It  is  contemplated  that  the  funds  presently 
allotted  to  this  subcontract  will  cover  the  work  to  be  performed  through 
31  August  1990. 

10.  Choice  of  Law;  Irrespective  of  the  place  of  performance,  this  Subcontract 
will  be  construed  and  interpreted  according  to  the  federal  common  law  of 
government  contracts  enunciated  and  applied  by  federal  judicial  bodies, 
boards  of  contract  appeals,  and  quasi -judicial  agencies  of  the  federal 
government.  To  the  extent  that  the  federal  common  law  of  government 
contracts  is  not  dispositive,  the  laws  of  the  state  from  which  this 
subcontract  is  issued  shall  apply. 

11.  DISPUTES;  Either  party  may  litigate  any  dispute  arising  under  or  relating 
to  this  Subcontract  before  any  court  of  competent  jurisdiction.  Pending 
resolution  of  any  such  dispute  by  settlement  or  by  final  judgment,  the 
parties  shall  proceed  diligently  with  performance.  Subcontractor's 
performance  shall  be  in  accordance  with  UES's  written  instructions.  All 
references  to  disputes  procedures  in  Government  clauses  incorporated  by 
reference  shall  be  deemed  to  be  superseded  by  this  clause. 

12.  ENTIRE  CONTRACT ;  This  Subcontract  and  any  documents  specifically 
incorporated  herein  constitute  the  entire  agreement  between  the  parties. 
This  written  agreement  supersedes  all  other  prior  agreements,  written  or 
oral,  between  the  parties  related  to  the  subject  matter  hereof  unless 
specifically  otherwise  provided  herein.  The  terms  and  obligations  created 
hereby  shall  be  changed  only  in  writing  as  a  modification  to  this 
agreement,  duly  executed  by  the  parties  hereto. 

13.  ACKNOWLEDGEMENT  CF  SPONSORSHIP 


a.  The  Subcontractor  agrees  that  in  the  release  of  information  relating 
to  this  contract  a  copy  will  be  provided  to  UES  and  such  release  shall 
include  a  statement  to  the  effect  that  the  project  or  effort  depicted  was 
or  is  sponsored  by  the  agency  set  forth  below. 


7 


AIR  FORCE  OFFICE  OF  SCIENTIFIC  RESEARCH 
BO-LING  AFB  DC 

b.  For  the  purpose  of  this  clause,  ’‘information"  includes  but  is  not 
limited  to,  news  releases,  articles,  manuscripts,  brochures,  advertisement, 
still  and  motion  pictures,  speeches,  trade  association  meetings,  symposia, 
etc. 

14.  Amendments  to  this  subcontract  agreement  may  be  negotiated  between  LES  and 
Subcontractor.  Such  amendments  must  be  in  writing,  and  upon  execution  by 
both  parties,  will  become  a  part  of  this  subcontract  agreement  and  will 
be  subject  to  all  other  applicable  terms  and  conditions  of  this  subcontract 
agreement . 


SECTION  I 

GENERAL  PROVISIONS 

1.  Subcontractor  agrees  to  perform  the  services  in  accordance  with  the  following 
Federal  Acquisition  Regulations  (FAR)  clauses,  which  are  incorporated  herein 
by  reference  with  the  same  force  and  affect  as  though  herein  set  forth  in 
full.  The  clauses  applicable  to  this  Subcontract  Agreement  are  limited  to 
only  those  which  contain  a  specific  subcontract/purchase  order  flow  down 
requirement  as  may  be  contained  in  each  particular  clause.  Except  with 
respect  to  "Audit  Negotiation"  and  "Examination  of  Records  by  Comptroller 
(jeneral,"  the  Special  Provisions  and  FAR  clauses,  including  Special  Clauses 
in  full  text,  shall  be  deemed  to  be  modified  as  appropriate  to  substitute  the 
word  "UES"  for  "Government"  or  "Contracting  Officer,"  substitute  the  word 
"Subcontractor"  for  "Contractor,"  and  substitute  the  word  "Subcontract"  for 
"Contract." 


I  Federal  Acquisition  Regulation  Clauses 


FAR  Paraaraoh  No. 

Clause  Title 

Date 

52.212-8 

Defense  Priority  and  Allocation 
Requirements 

Nay  1986 

52.215-1 

Examination  of  Records  by  Comptroller 
General 

April  1984 

52.215-2 

Audit-Negotiation 

April  1984 

52.222-4 

Contract  Work  Hours  and  Safety 

Standards  Act — Overtime  Compensation 

Narch  1986 

52.222-26 

Equal 

Opportunity 

52.222-35 

April  1984 

Affinrative  Aiticn 

for  Special 

April  1984 

Disabled  and  Vietnam  Era  Veterans 

52.222-36 
of  Handicapped 

April  1984 

Workers 

Affinrati'/e  A:ticn 

8 


FAR  Paragraph  No. 
52.223-2 

52.227- 1 

52.227- 2 

52.227- 11 
Retention  by  the 

52.243-1 


Attachment 

1. 

2. 


Clause  Title 
Clean  Air  and  Water 
Authorization  and  Consent 
Alternate  I  (April  1984) 

Notice  and  Assistance  Regarding 
Patent  and  Copyright  Infringement 

April  1984 

Contractor  (Short  Form) 
Changes-Fixed  Price 


SECTION  J 

rmJMENTS.  EXHIBITS  &  ATTACHMENTS 

Number  Description 

Notification  Letter 

Progress  Certification 


Date 

April  1984 
April  1984 

April  1984 

Patent  Rights 

April  1984 


Date 

July  1989 
Undated 


9 


Universal  Energy  Systems,  inc. 


17  July  1989 


Dear  _ : 

Ccngratulations  cdr  being  selected  to  participate  in  the  Department  of  Defense 
funded  National  Delense  Science  and  Engineering  Graduate  (NDSEG)  Fellowship 
Program,  under  the  sponsorship  of  the  Air  Force. 

Universal  Energy  Systems,  Inc.  (UES)  has  been  selected  by  the  Air  Force  Office 
of  Scientific  Research  (AFOSR)  to  administer  the  Academic  Year  of  your 
felloi'iship.  UES  is  currently  in  the  process  of  preparing  a  subcontract  with 

_  for  your  tenure  on  the  NDSEG  fellowship.  You  will  be  kept  informed  as 

this  progresses.  We  will  establish  the  fellowship  with  _  so  that 

your  stipend  will  be  paid  directly  to  you  from  the  university.  Your  obligations 
under  this  program  are  to  make  satisfactory  progress  each  year  toward  your  PhD, 
provide  in[Dut  to  UES  at  the  completion  of  each  academic  year,  and  provide  the 
Air  Force  with  a  copy  of  your  thesis  at  the  completion  of  your  PhD. 

I  have  enclose  a  Fellowship  Conditions  Form,  please  sign  and  return  this  to  LES 
at  your  earliest  convenience. 

I  have  also  attached  information  on  additional  opportunities  that  AFOSR  is 
offering  to  the  NDSEG  fellows.  Please  review  this  information  and  notify  UES 
of  your  interest.  Since  an  acceptance  of  the  opportunity  will  increase  your 
yearly  stipend,  it  affects  our  negotiations  with  your  university.  Your 
assistance  in  notifying  UES  as  soon  as  possible  will  greatly  help  us  with  the 
negotiations  with  your  university. 

If  you  have  any  questions  regarding  these  matters,  please  do  not  hesitate  to 
contact  the  undersigned,  or  Ns.  Sue  Espy  (Program  Administrator)  at 
1-800-533-7532  or  in  Ohio  513-426-6900. 

Congratulations  1  1  and  good  luck  on  your  continued  efforts  in  graduate  school . 
Sincerely, 

UNIVERSAL  ENERGY  SYSTENS,  INC 


Rodney  C .  Darrah 
Program  Director 

xc;  Rod  Darrah 
UES  contracts 
Lt.  Col.  Claude  Cavender 
for  the  University  Subcontract 

4401  Oayton-Xenia  Rd.  •  Dayton,  Ohio  45432-1894  •  phone  513/426-6900  •  FAX  513/429-5413 

Cable  :  ENERGY  •  Telex:  288250 


WTIONAL  DEFENSE  SCIENCE  AND  ENGINEERING 
GRADUATE  FELLC3WSHIP 
CONDITIONS  FORM 


I  understand  that  the  follcjwing  obligations  and  conditions  concerning  my 

Department  of  Defense  National  Defense  Science  and  Engineering  Graduate 

Fellowship. 

1.  Conditions  of  my  fellowship  for  each  fellowship  year  is  contingent  on  my 

satisfactory  progress  toward  a  PhD  in  _ . 

2.  I  will  provide  Universal  Energy  Systems,  Inc.  (UES)  with  a  statement  of 
progress  each  academic  year. 

3.  I  will  report  any  significant  results,  awards,  honors,  etc.  to  UES. 

4.  In  the  event  that  I  must  withdraw  from  the  fellowship  program,  I  will  provide 
UES  with  a  statement  of  the  reasons  for  my  withdrawal  and  a  statement  of  my 
future  employment  and  educational  plans. 

5.  I  will  provide  the  Air  Force  with  a  copy  of  my  PhD  thesis  upon  the  completion 
of  my  PhD  work. 


Signature 


Date 


11 


Certif icaticxn  Needed  for  Each  Academic  Term 
CERTIFICATION  OF  ACADEMIC  PROGRESS 


Fellow;  _ 

University; 


Semes ter /Academic  Term; 
Subcontract;  S-7a9-DoD- 


Fellow  to  complete 

1. Courses  -  Give  description  of  courses  and  grades  received.  (Attach  sheet  if 
extra  space  is  needed . ) 


2. Give  a  description  of  research  and  progress  toward  dissertation.  (Attach 
sheets  if  extra  space  is  needed). 


3. Give  a  brief  statement  of  your  involvement  with  the  Laboratory  _____  and 

_ .  Also  list  any  items  of  interest  such  as  academic  awards,  publications, 

other  information  that  can  be  used  for  NDSEG  Fellowship  Program. 


"I  certify  that  all  information  stated  is  correct  and  complete." 


Signature/Fellowship  Recipient 


TYPED  NAME/FELLOWSHIP  RECIPIENT 


12 


CERTIFICATION  OF  ACADEMIC  PROGRESS 


"I  certify 
a  Ph.D.  in 


that  _  is  making  satisfactory  academic  progress  toward 

the  area  of  _  for  the  _  19  academic  term." 


Signature/Advising  Professor 


TYPED  NAME/TITLE  OF  ADVISING  PROFESSOR 


13 


T 


Universal  Energy  Systems,  Inc. 


11  October  1989 


Student's  Name  &  Address 


Dear  _ ; 

The  laboratory  and  mentor  assignment  have  been  completed.  You  have  been  assigned 
to  the  _ .  Information  on  your  mentor  is  listed  below; 

MENTOR  ^JA^E  Mentor's  Address  Mentor's  Phone 


You  may  contact  your  mentor  about  the  program  involvement  of  the  laboratory. 
Your  mentor  may  be  contacting  you  as  well. 

As  a  reminder,  you  are  eligible  for  a  three  day  visit  to  the  laboratory  to 
discuss  your  research  efforts  with  your  mentor.  If  you  wish  to  take  advantage 
of  this  visit,  arrangements  must  be  made  with  LES.  Contact  Sue  Espy  at  the  UES 
office-  All  travel  costs  and  a  per  diem  allowance  for  the  three  days  will  be 
provided  by  LES.  In  addition,  you  are  eligible  to  spend  10  to  14  weeks  during 
the  summers  doing  research  at  the  laboratory.  Your  stipend  will  be  paid  to  you 
by  LES  during  the  time  that  you  are  at  the  laboratory.  All  travel  costs  for  the 
round  trip  to  the  lab  for  the  summer  work  will  be  reimbursed  by  LES.  For  the 
time  that  you  are  at  the  lab,  LES  will  provide  an  expense  allowance. 
Arrangements  for  spending  the  summers  at  the  laboratory  need  to  be  coordinated 
with  your  mentor. 


If  you  have  any  questions  concerning  the  mentor  assignment,  please  do  not 
hesitate  to  contact  us  as  UES. 

Sincerely, 

UNIVERSAL  ENERGY  SYSTEMS,  I^C 


Rodney  C .  Darrah 
Program  Dire.. tor 

xc;  Chief  Scientist 
Meritor 
Frjcal  Point 

r ; \wp\ndseg\mentor . 1 tr 


4401  Dayton-Xenia  Rd.  •  Dayton,  Ohio  45432^1^94  •  phone  513/426-6900  •  FAX  513/429-5413 

Cable  :  ENERGY  •  Telex;  288250 


